Of all of the things to use to represent the FFT, Shazam is not the one I would pick
Opulent-tortoise on
This video is AI slop. It doesn’t explain the algorithm at all other than that it uses FFT. It’s clearly just a manim animation generated by an LLM with a text-to-speech script
Roni1209 on
That shit never works lmao
Famous_Cow9640 on
I mean, this is so dumbed down it’s ridic. This would take a while, it seems, to do the comparison of the FFT results against all the songs or pieces of songs. Yet this typically only takes seconds to get a result, what are they leaving out? There has to be some way that they narrow the choices that they are going to compare the FFT results against. I really doubt they are just blindly, brute force, going through all the songs the same way each time a song section is sampled
nexxlevelgames on
its looking for a key
King_K_24 on
It doesn’t listen to your music… is just listens to your music and the represents it in a way it understands?
rubiksalgorithms on
Wow so calculus actually has an application in the real world
Iam_The_Real_Fake on

DuckCleaning on
>”Shazam never actually listens to your music”
It is listening to the music and breaking down the frequencies.
Pondering_Moose on
For anyone curious this is outdated and not how modern systems work, with the rise of nueral nets (yes obviously more ai bs), the output of the fft is usually used for a model to generate an embedding, a numerical representation of the sound to understand and map songs in a near identical way to how language models like chatgpt map from your question to the words it selects to respond withÂ
16 Comments
Well , that’s Shazam

Me:
Hence why it NEVER FREAKIN WORKS
Rollin Cali weed in a zig zag paper
Sure but it still sells your data
well I’ll be damned
Of all of the things to use to represent the FFT, Shazam is not the one I would pick
This video is AI slop. It doesn’t explain the algorithm at all other than that it uses FFT. It’s clearly just a manim animation generated by an LLM with a text-to-speech script
That shit never works lmao
I mean, this is so dumbed down it’s ridic. This would take a while, it seems, to do the comparison of the FFT results against all the songs or pieces of songs. Yet this typically only takes seconds to get a result, what are they leaving out? There has to be some way that they narrow the choices that they are going to compare the FFT results against. I really doubt they are just blindly, brute force, going through all the songs the same way each time a song section is sampled
its looking for a key
It doesn’t listen to your music… is just listens to your music and the represents it in a way it understands?
Wow so calculus actually has an application in the real world

>”Shazam never actually listens to your music”
It is listening to the music and breaking down the frequencies.
For anyone curious this is outdated and not how modern systems work, with the rise of nueral nets (yes obviously more ai bs), the output of the fft is usually used for a model to generate an embedding, a numerical representation of the sound to understand and map songs in a near identical way to how language models like chatgpt map from your question to the words it selects to respond withÂ