Hi everyone! I’m 15, and I’ve been working on this data visualization project for a while. I wanted to connect planets not by their physical distance in space, but by their actual physical characteristics (mass, star type, discovery method, etc.).

    I built a Python backend to fetch data from the NASA Exoplanet Archive, vectorized the features using Ollama, and pre-calculated the 3D coordinates in the background with NetworkX so the WebGL-based frontend doesn't freeze your browser.

    I am completely open to any ideas and suggestions on how I can take this project further. I would love to hear your valuable advice to help me learn and improve myself in this field.

    Source Code & Repo: https://github.com/pancodurden/astro-net

    by avariabase0

    12 Comments

    1. mglyptostroboides on

      Just a fair warning: People are gonna dogpile you for using an LLM. Even though it was used for an actually appropriate purpose here. Even though it was locally hosted. People have no nuance about the topic.

    2. What does your age have to do with anything? Did you think you’d get extra little brownie points for it?

    3. YttriumZirconium1 on

      Confused why you mention your age twice. Can’t find the relevance other than hoping for praise

    4. I can’t understand what I’m looking at at all and I’ve seen graphs showing expplanets organized by size mass distance from their star composition star type etc.

    5. Seems like a really creative project with a lot of different parts to learn. Keep it up!

    6. What am I looking at? For me, a good visualization should be conveying clear and concise information. Right now, even with your description of what you did, I could not extract any meaningful information about the exoplanets.

    7. DragonSnap506 on

      I’m confused why you call it an LLM when you aren’t using an LLM at all. Vector embeddings are something entirely different from an LLM. This project constructs a semantic similarity graph based on planets characteristics and doesn’t just use an LLM to “calculate” similarity.

      Also it seems you have a 0.92 similarity threshold but the template:
      f”Planet {pl_name} orbits star {hostname} of type {st_type}. Discovered in {discovery_year} using {method}” would give any two planets 85% similarity even if they shared nothing in common since the text string is similar.

      I think the project is fine in theory but it would be a lot more meaningful to not use a text embedding model and instead create a similarity graph based on the planets actual data. Typically embedding models are used when the underlying data is text. Here since you have structured data it might be better to directly compare fields to each other.

    8. killboticus89 on

      Could you make any part clickable? Or on hover an exoplanet point displays info? 

    9. otoko_no_hito on

      Okey, ngl this is cool… As a 33 year old senior engineer let me tell you this, man you have a gift at this, you might have had some help from an LLM and that’s fine, just the fact that you can think and understand this is pretty amazing! 

      The next logical step would be to categorize and build models that predict the likelihood of a system to have certain types of planets based on stellar composition and whatever else you might think as a variable.

    Leave A Reply