This is how Shazam works.

March 21, 2026

by Jealous-Mirror-4576

View 16 Comments

16 Comments

Bronze000 on March 21, 2026 4:49 am

Well , that’s Shazam
thepoylanthropist on March 21, 2026 4:54 am

![gif](giphy|tIzYvGsndYZu9yO0Pv)

Me:
The_Dank_Tortuga on March 21, 2026 4:55 am

Hence why it NEVER FREAKIN WORKS
zagomyego on March 21, 2026 5:04 am

Rollin Cali weed in a zig zag paper
Dipswitch_512 on March 21, 2026 5:11 am

Sure but it still sells your data
dh2513 on March 21, 2026 5:12 am

well I’ll be damned
danfay222 on March 21, 2026 5:12 am

Of all of the things to use to represent the FFT, Shazam is not the one I would pick
Opulent-tortoise on March 21, 2026 5:14 am

This video is AI slop. It doesn’t explain the algorithm at all other than that it uses FFT. It’s clearly just a manim animation generated by an LLM with a text-to-speech script
Roni1209 on March 21, 2026 5:23 am

That shit never works lmao
Famous_Cow9640 on March 21, 2026 5:24 am

I mean, this is so dumbed down it’s ridic. This would take a while, it seems, to do the comparison of the FFT results against all the songs or pieces of songs. Yet this typically only takes seconds to get a result, what are they leaving out? There has to be some way that they narrow the choices that they are going to compare the FFT results against. I really doubt they are just blindly, brute force, going through all the songs the same way each time a song section is sampled
nexxlevelgames on March 21, 2026 5:28 am

its looking for a key
King_K_24 on March 21, 2026 5:37 am

It doesn’t listen to your music… is just listens to your music and the represents it in a way it understands?
rubiksalgorithms on March 21, 2026 5:38 am

Wow so calculus actually has an application in the real world
Iam_The_Real_Fake on March 21, 2026 5:52 am

![gif](giphy|3iBcMfGoHJ6KNplykY|downsized)
DuckCleaning on March 21, 2026 5:52 am

>”Shazam never actually listens to your music”

It is listening to the music and breaking down the frequencies.
Pondering_Moose on March 21, 2026 5:54 am

For anyone curious this is outdated and not how modern systems work, with the rise of nueral nets (yes obviously more ai bs), the output of the fft is usually used for a model to generate an embedding, a numerical representation of the sound to understand and map songs in a near identical way to how language models like chatgpt map from your question to the words it selects to respond with