深度学习自动分类器：XRD数据利用

测定固体和液体材料的晶体结构对于理解它们的机械、电磁和热力学性质是十分重要的。粉末X射线衍射（XRD）是材料表征的重要手段，它编码了关于晶体对称性、晶格参数、类型以及纳米级上原子的填充信息。然而，现在的分类方法需要大量的人为干预，根据总体信息综合评判来完成分类。有许多变量会影响XRD图案的形状，如材料的相或晶格，如果没有已知类似的结构，就很难表征材料。

Fig. 1 Crystal system and space group distributions.

此外，样品中存在的一些少量的杂质相可能会导致分类更加困难、耗时和不准确。超快同步X射线衍射和光谱学测量的最新进展在于从数百万次测量中产生了极大的数据集，远远超过了人类可以手动分析的数据量。

Fig. 2 Diffraction pattern comparison.

因此，对XRD数据进行自适应和自动分析目前存在迫切的需求。目前已开发的深度学习模型在不同数据集下的表现差异极大，表现为鲁棒性不足的特点。因此，我们需要一个更具有鲁棒性的模型，可以对不同材料的动态和/或看不见的真实XRD数据进行分类。

Fig. 3 Model Architectures.

来自罗切斯特大学机械工程学院的Niaz Abdolrahim教授小组，开发了一种用于晶体系统和空间群分类的广义深度学习模型。由于XRD数据中的相对峰强度、距离和顺序表征了对称性，研究人员研究了其中是否存在排列不变性和平移不变性，并据此提出了无池化的卷积神经网络（NPCNN），基于索引峰之间的相对和局部推理来表征材料，以此来完成分类工作。

Fig. 4 RRUFF Experimental Performance.

为了实现广泛的分类功能，作者还开发了一个数据生成流水线来建立高质量的数据集，该流水线结合了对衍射模式的实验效应，并且具有模拟经过合金化和/或动态实验的材料的能力。最后，研究人员成功使深度学习模型发挥出了最先进的性能。

Fig. 5 RRUFF and MP dataset Confusion Matrix. Confusion

该研究也为开发其他光谱表征技术模型提供了有效的研究思路。相关论文近期发布于npj Computational Materials v9: 214 (2023)。

Fig. 6 Materials project performance.

Editorial Summary

XRD data：An automated deep learning classifier

Determining the crystal structure of solid and liquid materials is important for understanding their mechanical, electromagnetic and thermodynamic properties. Powder X-ray diffraction (XRD) is an important means of material characterization, encoding information about crystal symmetry, lattice parameters, type, and filling of atoms on nanoscale domains.

Fig. 7 Lattice augmentation performance.

However, the current classification method requires a lot of human intervention to complete the classification based on comprehensive evaluation of the overall information. There are many variables that affect the shape of an XRD pattern, such as the phase or crystal lattice of the material. Without a known similar structure, it is difficult to characterize the material. In addition, the presence of some small amounts of impurity phases in the sample may make classification more difficult and time-consuming. and inaccuracies.

Fig. 8 F1 Score on RRUFF and MP datasets.

Recent advances in ultrafast synchronized XRD and spectroscopy measurements have generated extremely large data sets from millions of measurements, far exceeding what humans can manually analyze. Therefore, there is an urgent need for adaptive and automatic analysis of XRD data. The performance of currently developed deep learning models on different data sets varies greatly, showing insufficient robustness. A more robust model is needed that can classify dynamic and/or unseen real XRD data obtained from different materials.

Fig. 9 Scatterplot on MP performance.

A group led by Prof. Niaz Abdolrahim from the School of Mechanical Engineering, University of Rochester, developed a generalized deep learning model for crystal system and space group classification. Considering that the relative peak intensity, distance and order in XRD data indicate symmetry, the researchers investigated whether there is alignment invariance and translation invariance, and based on this, they proposed a no-pool convolutional neural network (NPCNN). Classification was accomplished by characterizing materials based on relative and local inferences between indexed peaks. To enable extensive classification capabilities, the authors also developed a data generation pipeline to build high-quality data sets that incorporates experimental effects on diffraction patterns. The pipeline also has the capability of simulating materials that undergo alloying and/or dynamic experimentation. The researchers succeeded in making the deep learning model achieve state-of-the-art performance. This study provides a valuable platform for developing models of other spectral characterization techniques. This article was recently published in npj Computational Materials v9: 214 (2023).

Fig. 10 Model architecture taxonomy.

原文Abstract及其翻译

Automated classification of big X-ray diffraction data using deep learning models (使用深度学习模型对大X射线衍射数据进行自动分类)

Jerardo E. Salgado, Samuel Lerman, Zhaotong Du, Chenliang Xu & Niaz Abdolrahim

Abstract In current in situ X-ray diffraction (XRD) techniques, data generation surpasses human analytical capabilities, potentially leading to the loss of insights. Automated techniques require human intervention, and lack the performance and adaptability required for material exploration. Given the critical need for high-throughput automated XRD pattern analysis, we present a generalized deep learning model to classify a diverse set of materials’ crystal systems and space groups. In our approach, we generate training data with a holistic representation of patterns that emerge from varying experimental conditions and crystal properties. We also employ an expedited learning technique to refine our model’s expertise to experimental conditions. In addition, we optimize model architecture to elicit classification based on Bragg’s Law and use evaluation data to interpret our model’s decision-making. We evaluate our models using experimental data, materials unseen in training, and altered cubic crystals, where we observe state-of-the-art performance and even greater advances in space group classification.

摘要在当前的原位X射线衍射（XRD）技术中，数据的生成能力超过了人类的分析能力，有可能导致洞察力的损失。自动化技术需要人工干预，并且缺乏材料研究所需的性能和适应性。鉴于高通量自动XRD模式分析的迫切需求，我们提出了一个广义的深度学习模型来分类不同材料的晶体系统和空间群。在我们的方法中，我们利用来自不同实验条件和晶体性质模式的整体表示来生成训练数据。我们还采用了一种快速学习技术来改进我们模型在实验条件下的专长。此外，我们优化了模型架构，以引出基于布拉格定律的分类，并使用评估数据来解释我们模型的决策。我们使用实验数据、训练中未见的材料以及改变了的立方晶体，来评估模型，我们观察到最先进的性能和空间群分类的更大进步。

原创文章，作者：计算搬砖工程师，如若转载，请注明来源华算科技，注明出处：https://www.v-suan.com/index.php/2024/01/28/fbb3609813/

深度学习自动分类器：XRD数据利用

相关推荐

分享到：