Here's your real "deep state" you dumb homoerotic worshiping motherfuckers and classic rock scum:
Google Research engineers have developed a deep learning system that can separate voices from audio-visual data recorded in crowded environments. The result was a system that could be used to isolate voices in environments with multiple humans talking. The only condition is that the talking person's face must be visible on screen, so the AI can correlate one of the multiple voice tracks to a certain face and prioritize it over the rest:
Story here:
https://www.bleepingcomputer.com/new...es-in-a-crowd/