Search results:

Benchmarking Without Reality: Dataset Construction Bias in Materials Evaluation

In the rapidly evolving field of computational and data-driven materials engineering, machine learning models are increasingly deployed for property prediction, inverse design, and autonomous discovery. However, the integrity of these models hinges on the quality of training datasets, which often embed subtle biases arising from construction methodologies. This manuscript explores the conceptual underpinnings of dataset construction bias in materials AI evaluation, framing it as an epistemic challenge that distorts benchmarking outcomes and impedes genuine materials discovery. We introduce the Dataset Integrity Cascade (DIC) framework, a layered conceptual model that maps data curation processes to inference distortions, incorporating feedback mechanisms to reveal how biases propagate through representation learning, model training, and validation pipelines. By synthesizing recent advances in materials informatics, graph neural networks, and uncertainty quantification, the framework highlights systemic trade-offs between dataset scale and representational fidelity. Implications extend to high-throughput computation, closed-loop experimentation, and foundation models for science, suggesting pathways for more robust computational steering in materials design. This work underscores the need for integrative approaches that align dataset architectures with the inherent complexities of materials systems, fostering epistemically sound innovation without empirical validation.

Journal of Computational and Data-Driven Materials Engineering

Original Research | Open access | 18 March 2023 | Article: 95

Filters

Clear All

Subject

AI-Assisted Materials Synthesis Advanced Functional Materials Artificial Intelligence in Materials Science Computational Materials Engineering Computational Materials Science Data-Driven Materials Design Digital Materials Engineering Digital Twin for Materials Systems High-Throughput Materials Screening Integrated Computational Materials Engineering (ICME) Machine Learning for Materials Discovery Materials Characterization and Analysis Materials Characterization and Data Analysis Materials Data Analytics Materials Informatics Materials Modeling and Simulation Materials Optimization Multiscale Materials Modeling Nanomaterials Predictive Modeling of Material Properties Smart Materials Sustainable Materials Design Sustainable Materials Development

Journal

Journal of Artificial Intelligence for Materials Science Journal of Computational and Data-Driven Materials Engineering

Year

2026 2025 2024 2023 2022 2021 2020 2019 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000

Article type

Original Research Review Systematic Review Mini Review Meta-Analysis Case Report Case Study Clinical Trial Methods Methodology Article Data Report Dataset Paper Perspective Opinion Editorial Letter to the Editor Commentary General Commentary Policy and Practice Review Policy Brief Educational Material Hypothesis and Theory Short Communication Technical Report Research Report Cross-Sectional Study Cohort Study Case-Control Study Classification Correction Erratum Retraction Replication Study Philosophical Analysis Protocol Registered Report Brief Report Conference Paper Book Review Article

Access type

Open access