This is how Shazam works.



    by Jealous-Mirror-4576

    16 Comments

    1. Of all of the things to use to represent the FFT, Shazam is not the one I would pick

    2. Opulent-tortoise on

      This video is AI slop. It doesn’t explain the algorithm at all other than that it uses FFT. It’s clearly just a manim animation generated by an LLM with a text-to-speech script

    3. Famous_Cow9640 on

      I mean, this is so dumbed down it’s ridic. This would take a while, it seems, to do the comparison of the FFT results against all the songs or pieces of songs. Yet this typically only takes seconds to get a result, what are they leaving out? There has to be some way that they narrow the choices that they are going to compare the FFT results against. I really doubt they are just blindly, brute force, going through all the songs the same way each time a song section is sampled

    4. It doesn’t listen to your music… is just listens to your music and the represents it in a way it understands?

    5. DuckCleaning on

      >”Shazam never actually listens to your music”

      It is listening to the music and breaking down the frequencies.

    6. Pondering_Moose on

      For anyone curious this is outdated and not how modern systems work, with the rise of nueral nets (yes obviously more ai bs), the output of the fft is usually used for a model to generate an embedding, a numerical representation of the sound to understand and map songs in a near identical way to how language models like chatgpt map from your question to the words it selects to respond with 

    Leave A Reply