Inferring properties of dust in supernovae with neural networks

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

Inferring properties of dust in supernovae with neural networks. / Ansari, Zoe; Gall, Christa; Wesson, Roger; Krause, Oswin.

In: Astronomy & Astrophysics, Vol. 666, A176, 25.10.2022.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Ansari, Z, Gall, C, Wesson, R & Krause, O 2022, 'Inferring properties of dust in supernovae with neural networks', Astronomy & Astrophysics, vol. 666, A176. https://doi.org/10.1051/0004-6361/202243078

APA

Ansari, Z., Gall, C., Wesson, R., & Krause, O. (2022). Inferring properties of dust in supernovae with neural networks. Astronomy & Astrophysics, 666, [A176]. https://doi.org/10.1051/0004-6361/202243078

Vancouver

Ansari Z, Gall C, Wesson R, Krause O. Inferring properties of dust in supernovae with neural networks. Astronomy & Astrophysics. 2022 Oct 25;666. A176. https://doi.org/10.1051/0004-6361/202243078

Author

Ansari, Zoe ; Gall, Christa ; Wesson, Roger ; Krause, Oswin. / Inferring properties of dust in supernovae with neural networks. In: Astronomy & Astrophysics. 2022 ; Vol. 666.

Bibtex

@article{2b241fbf0b99461fa2237bc0c785e734,
title = "Inferring properties of dust in supernovae with neural networks",
abstract = "Context. Determining properties of dust that formed in and around supernovae from observations remains challenging. This may be due to either incomplete coverage of data in wavelength or time, but also due to often inconspicuous signatures of dust in the observed data.Aims. Here we address this challenge using modern machine learning methods to determine the amount and temperature of dust as well as its composition from a large set of simulated data. We aim to quantify if such methods are suitable to infer quantities and properties of dust from future observations of supernovae.Methods. We developed a neural network consisting of eight fully connected layers and an output layer with specified activation functions that allowed us to predict the dust mass, temperature, and composition as well as their respective uncertainties for each single supernova of a large set of simulated supernova spectral energy distributions (SEDs). We produced the large set of supernova SEDs for a wide range of different supernovae and dust properties using the advanced, fully three-dimensional radiative transfer code MOCASSIN. We then convolved each SED with the entire suite of James Webb Space Telescope (JWST) bandpass filters to synthesise a photometric data set. We split this data set into three subsets which were used to train, validate, and test the neural network. To find out how accurately the neural network can predict the dust mass, temperature, and composition from the simulated data, we considered three different scenarios. First, we adopted a uniform distance of similar to 0.43 Mpc for all simulated SEDs. Next we uniformly distributed all simulated SEDs within a volume of 0.43-65 Mpc and, finally, we artificially added random noise corresponding to a photometric uncertainty of 0.1 mag. Lastly, we conducted a feature importance analysis via SHapley Additive exPlanations (SHAP) to find the minimum set of JWST bandpass filters required to predict the selected dust quantities with an accuracy that is comparable to standard methods in the literature.Results. We find that our neural network performs best for the scenario in which all SEDs are at the same distance and for a minimum subset of seven JWST bandpass filters within a wavelength range 3-25 mu m. This results in rather small root-mean-square errors (RMSEs) of similar to 0.08 dex and similar to 42 K for the most reliable predicted dust masses and temperatures, respectively. For the scenario in which SEDs are distributed out to 65 Mpc and contain synthetic noise, the most reliable predicted dust masses and temperatures achieve an RMSE of similar to 0.12 dex and similar to 38 K, respectively. Thus, in all scenarios, both predicted dust quantities have smaller predicted uncertainties compared to those in the literature achieved with common SED fitting methods of actual observations of supernovae. Moreover, our neural network can well distinguish between the different dust species included in our work, reaching a classification accuracy of up to 95% for carbon and 99% for silicate dust.Conclusions. Although we trained, validated, and tested our neural network entirely on simulated SEDs, our analysis shows that a suite of JWST bandpass filters containing NIRCam F070W, F140M, F356W and F480M as well as MIRI F560W, F770W, F1000W, F1130W, F1500W, and F1800W filters are likely the most important filters needed to derive the quantities and determine the properties of dust that formed in and around supernovae from future observations. We tested this on selected optical to infrared data of SN 1987A at 615 days past explosion and find good agreement with dust masses and temperatures inferred with standard fitting methods in the literature.",
keywords = "galaxies: star formation, methods: statistical, supernovae: general, SN 1987A, EMISSION, REMNANT, PHOTOIONIZATION, EJECTA, CONSTRAINTS, MOCASSIN, GROWTH, GRAINS, GALAXY",
author = "Zoe Ansari and Christa Gall and Roger Wesson and Oswin Krause",
year = "2022",
month = oct,
day = "25",
doi = "10.1051/0004-6361/202243078",
language = "English",
volume = "666",
journal = "Astronomy & Astrophysics",
issn = "0004-6361",
publisher = "E D P Sciences",

}

RIS

TY - JOUR

T1 - Inferring properties of dust in supernovae with neural networks

AU - Ansari, Zoe

AU - Gall, Christa

AU - Wesson, Roger

AU - Krause, Oswin

PY - 2022/10/25

Y1 - 2022/10/25

N2 - Context. Determining properties of dust that formed in and around supernovae from observations remains challenging. This may be due to either incomplete coverage of data in wavelength or time, but also due to often inconspicuous signatures of dust in the observed data.Aims. Here we address this challenge using modern machine learning methods to determine the amount and temperature of dust as well as its composition from a large set of simulated data. We aim to quantify if such methods are suitable to infer quantities and properties of dust from future observations of supernovae.Methods. We developed a neural network consisting of eight fully connected layers and an output layer with specified activation functions that allowed us to predict the dust mass, temperature, and composition as well as their respective uncertainties for each single supernova of a large set of simulated supernova spectral energy distributions (SEDs). We produced the large set of supernova SEDs for a wide range of different supernovae and dust properties using the advanced, fully three-dimensional radiative transfer code MOCASSIN. We then convolved each SED with the entire suite of James Webb Space Telescope (JWST) bandpass filters to synthesise a photometric data set. We split this data set into three subsets which were used to train, validate, and test the neural network. To find out how accurately the neural network can predict the dust mass, temperature, and composition from the simulated data, we considered three different scenarios. First, we adopted a uniform distance of similar to 0.43 Mpc for all simulated SEDs. Next we uniformly distributed all simulated SEDs within a volume of 0.43-65 Mpc and, finally, we artificially added random noise corresponding to a photometric uncertainty of 0.1 mag. Lastly, we conducted a feature importance analysis via SHapley Additive exPlanations (SHAP) to find the minimum set of JWST bandpass filters required to predict the selected dust quantities with an accuracy that is comparable to standard methods in the literature.Results. We find that our neural network performs best for the scenario in which all SEDs are at the same distance and for a minimum subset of seven JWST bandpass filters within a wavelength range 3-25 mu m. This results in rather small root-mean-square errors (RMSEs) of similar to 0.08 dex and similar to 42 K for the most reliable predicted dust masses and temperatures, respectively. For the scenario in which SEDs are distributed out to 65 Mpc and contain synthetic noise, the most reliable predicted dust masses and temperatures achieve an RMSE of similar to 0.12 dex and similar to 38 K, respectively. Thus, in all scenarios, both predicted dust quantities have smaller predicted uncertainties compared to those in the literature achieved with common SED fitting methods of actual observations of supernovae. Moreover, our neural network can well distinguish between the different dust species included in our work, reaching a classification accuracy of up to 95% for carbon and 99% for silicate dust.Conclusions. Although we trained, validated, and tested our neural network entirely on simulated SEDs, our analysis shows that a suite of JWST bandpass filters containing NIRCam F070W, F140M, F356W and F480M as well as MIRI F560W, F770W, F1000W, F1130W, F1500W, and F1800W filters are likely the most important filters needed to derive the quantities and determine the properties of dust that formed in and around supernovae from future observations. We tested this on selected optical to infrared data of SN 1987A at 615 days past explosion and find good agreement with dust masses and temperatures inferred with standard fitting methods in the literature.

AB - Context. Determining properties of dust that formed in and around supernovae from observations remains challenging. This may be due to either incomplete coverage of data in wavelength or time, but also due to often inconspicuous signatures of dust in the observed data.Aims. Here we address this challenge using modern machine learning methods to determine the amount and temperature of dust as well as its composition from a large set of simulated data. We aim to quantify if such methods are suitable to infer quantities and properties of dust from future observations of supernovae.Methods. We developed a neural network consisting of eight fully connected layers and an output layer with specified activation functions that allowed us to predict the dust mass, temperature, and composition as well as their respective uncertainties for each single supernova of a large set of simulated supernova spectral energy distributions (SEDs). We produced the large set of supernova SEDs for a wide range of different supernovae and dust properties using the advanced, fully three-dimensional radiative transfer code MOCASSIN. We then convolved each SED with the entire suite of James Webb Space Telescope (JWST) bandpass filters to synthesise a photometric data set. We split this data set into three subsets which were used to train, validate, and test the neural network. To find out how accurately the neural network can predict the dust mass, temperature, and composition from the simulated data, we considered three different scenarios. First, we adopted a uniform distance of similar to 0.43 Mpc for all simulated SEDs. Next we uniformly distributed all simulated SEDs within a volume of 0.43-65 Mpc and, finally, we artificially added random noise corresponding to a photometric uncertainty of 0.1 mag. Lastly, we conducted a feature importance analysis via SHapley Additive exPlanations (SHAP) to find the minimum set of JWST bandpass filters required to predict the selected dust quantities with an accuracy that is comparable to standard methods in the literature.Results. We find that our neural network performs best for the scenario in which all SEDs are at the same distance and for a minimum subset of seven JWST bandpass filters within a wavelength range 3-25 mu m. This results in rather small root-mean-square errors (RMSEs) of similar to 0.08 dex and similar to 42 K for the most reliable predicted dust masses and temperatures, respectively. For the scenario in which SEDs are distributed out to 65 Mpc and contain synthetic noise, the most reliable predicted dust masses and temperatures achieve an RMSE of similar to 0.12 dex and similar to 38 K, respectively. Thus, in all scenarios, both predicted dust quantities have smaller predicted uncertainties compared to those in the literature achieved with common SED fitting methods of actual observations of supernovae. Moreover, our neural network can well distinguish between the different dust species included in our work, reaching a classification accuracy of up to 95% for carbon and 99% for silicate dust.Conclusions. Although we trained, validated, and tested our neural network entirely on simulated SEDs, our analysis shows that a suite of JWST bandpass filters containing NIRCam F070W, F140M, F356W and F480M as well as MIRI F560W, F770W, F1000W, F1130W, F1500W, and F1800W filters are likely the most important filters needed to derive the quantities and determine the properties of dust that formed in and around supernovae from future observations. We tested this on selected optical to infrared data of SN 1987A at 615 days past explosion and find good agreement with dust masses and temperatures inferred with standard fitting methods in the literature.

KW - galaxies: star formation

KW - methods: statistical

KW - supernovae: general

KW - SN 1987A

KW - EMISSION

KW - REMNANT

KW - PHOTOIONIZATION

KW - EJECTA

KW - CONSTRAINTS

KW - MOCASSIN

KW - GROWTH

KW - GRAINS

KW - GALAXY

U2 - 10.1051/0004-6361/202243078

DO - 10.1051/0004-6361/202243078

M3 - Journal article

VL - 666

JO - Astronomy & Astrophysics

JF - Astronomy & Astrophysics

SN - 0004-6361

M1 - A176

ER -

ID: 324552836