1. Transferability of audio adversarial attacks - Community members question whether adversarial audio tricks that work on open models carry over to widely‑used ASR systems like Whisper.
"Does this transfer to Whisper / CLAP-type audio models or is it ASR‑decoder specific?" – leonulicnik
"In general, if you zoom all the way out, yes the high level optimization problem is very similar..." – dijksterhuis
"Yeah, there have been several papers with attacks on Whisper:" – woodson 2. Outlook on the vulnerability landscape
- Debate over whether defenders or attackers will have the long‑term edge as LLMs mature, with some arguing that the supply of bugs is effectively limitless.
"My feeling is the defender wins in the long‑run. There's only a finite number of bugs and vulnerabilities." – energy123
"I doubt you can prove that." – jeffbee
"Vulnerabilities are perpetually being created..." – root_axis
3. AI‑generated supplemental audio tracks on video platforms
- Several users point out a recurring pattern of extra commentary‑style audio tracks on short‑form videos, apparently added to evade automated copyright takedowns.
"I'd guess it's more a way to avoid YouTube's copyright detection/etc rather than AI scraping per se." – tikhonj