Machine Learning Method Provides New Photometric Survey Classification----Chinese Academy of Sciences

Home / Newsroom / Research News / Physics

Machine Learning Method Provides New Photometric Survey Classification

Mar 30, 2022

The Javalambre Photometric Local Universe Survey (J-PLUS), conducted by the Observatorio Astrofísicode Javalambre (OAJ) situated at Teruel, Spain, uses an 83cm telescope to achieve a powerful 3D view of the nearby universe. J-PLUS has 12 passbands photometric data that is designed to extract the spectral features.

A recent study led by Dr. WANG Cunshi, a PhD candidate from the National Astronomical Observatories of the Chinese Academy of Sciences (NAOC), applied a Support Vector Machine (SVM) algorithm to classify the J-PLUS first data release (DR1) catalog into star, galaxy, and quasar.

The study was published in Astronomy & Astrophysics on March 18.

Machine learning, a cross-disciplinary subject of statistics, optimization, and computer science, creates algorithms that can process data based on a well-chosen sample set with features. It can help find the potential patterns in large data. SVM is a classification method that creates a hyperplane based on the distance of each object to itself to divide the objects.

The distinction between stars and other objects is obscure due to the point-spread detection of J-PLUS, so the researchers classify them to provide convenience for studies.

The training sample has been chosen from spectroscopy surveys including Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST), Sloan Digital Sky Survey (SDSS), and VERONCAT - the Veron Catalog of Quasars & AGN (VV13) catalogs. The spectroscopy data carry more information than photometric observations and are more precise. The SVM model using magnitudes as features has been chosen due to its high accuracy.

Machine-learning algorithms have higher accuracy when the object falls into the sample-dense space. The researchers constructed 12 3D density contours to approach the space and divided the J-PLUS DR1 catalog into interpolation and extrapolation.

The blind test is a method to validate the algorithm. It showed that the accuracy of interpolation was 96.50%, and the stars had the highest accuracy of 99.27%. The accuracy of extrapolations decreased to 79.1%, where the magnitude distribution of galaxies and quasars were different from the sample set.

In the classification, some abnormal objects are found. They tend to be difficult to be classified to each label, which means that they have roughly equal prediction probability to each label. The researchers tested the Mahalanobis distance to each label and listed 26 abnormal objects.

This work will supplement the J-PLUS catalog and provide a new method to screen abnormal objects. The contour method provides a new idea to control the predicting uncertainty.