Data, Preprocessing, and Sources
Data Sources
The data for the stars, their locations, their spectral types, and other information comes from the Augmented Tycho - HYG (AT-HYG) dataset. The majority of the stars seen in this visualization are from the HIPPARCOS dataset specifically.
Constellation shapes and skycultures are from Stellarium.
Exoplanet data was taken from NASA’s Exoplanet Archive
Pre-Processing
Preprocessing for the data was done in R. You can access the R scripts here. Generally, the following processing was done:
The processed data files are placed in the Unity project in the /Assets/Resources
folder.
- The full set of stars is foud in the
athyg_v31_cleaned_exo_all_stars.csv
file
- Each constellation set has a
constellationship_<set_name>.txt
and names\<set_name>_constellation_names.txt
file associated to the get the links and names of each constellation in that set.
Processing
There are several operations performed in the Unity application for data handling.
- Scale
- The (x,y,z) and (vx,vy,vz) are initially given in parsecs. These are read into Unity 1:1. Unity units default to meters, so, in the shader program, all vertex values are multiplied by a scaling factor of
0.30479999024
to convert from meters/parsecs to feet.
- The velocity of each star is given in km/sec. This value is converted in the shader program to parsec/year by multiplying by
0.00000102269
.
- Color
- The spectral type color scale mostly follows the Harvard spectral classification, with some notable expections.
- White Dwarfs, noted in the HIPPARCOS dataset as “D”, are drawn in the same color as type O stars, but with a solar radius of
0.1
.
- Subdwarfs, noted in the HIPPARCOS dataset as “s”, are grouped with spectral type G.
- Cool Giants are, noted in the HIPPARCOS dataset as “R”, “N”, and “S”, are grouped with spectral type M.
- Carbon stars and red giants are given the new classifcation “C”.
- Wolf-Rayet stars, noted in the HIPPARCOS dataset as “W”, are grouped with spectral type O.
- Peculiar objects, including unique stars, some nebulae, and some galaxies, are given the spectal type P.
- All stars are given a static size based on this spectral classification.
- Constellations
- Some constellations are made of stars that are missing data in the HIPPARCOS dataset. In these cases, the line from the constellation that is missing one or more stars is omitted.
- Exoplanet colors
- Stars without exoplanets in the archive are shown in dark gray.
- The number of discovered exoplanets for each star is shown with an increasingly bright color, with the maximum number, 6, shown in bright yellow.
Other Sources
Big Dipper Highlight
- Korean
- Arabic
- Chinese
- Sami
Art and Sound
Code
- Though not used, the PCX package for Unity was referenced to understand shaders.
App Features • Data, Processing, and Sources • Code • Development Details • Optimization Details