Search results:

Compositional Generalization as a Distinct Failure Mode in Materials AI

The integration of artificial intelligence into materials science has highlighted challenges in model performance, particularly in domains that require extrapolation beyond the training data distribution. This manuscript explores compositional generalization as a unique failure mode in materials AI, in which systems struggle to interpret novel combinations of atomic or molecular elements despite familiarity with individual components. Through a synthesis of recent literature, the analysis delineates how this failure manifests in predictive tasks, such as property estimation in alloys or polymers, revealing underlying tensions between data-driven learning and structural comprehension. Conceptual interpretations highlight the interplay between representational invariance and contextual dependencies, underscoring epistemic gaps in current architectures. The proposed framework interprets these dynamics through lenses of modular interaction and systemic feedback, emphasizing trade-offs in scalability and robustness. By examining the ethical ramifications of deployment in high-stakes applications, the discussion integrates insights into steering mechanisms that could mitigate such limitations without empirical validation. Ultimately, this conceptual inquiry fosters a deeper understanding of AI’s role in advancing materials discovery and advocates for interpretive strategies that prioritize holistic integration over isolated optimizations.

Journal of Artificial Intelligence for Materials Science

Original Research | Open access | 18 July 2022 | Article: 9

Temporal Generalization in Materials AI — What We Know About Model Aging and Drift

Artificial intelligence is rapidly reshaping materials science by accelerating property prediction, synthesis planning, and materials design. Yet most AI models for materials are developed and validated under implicit stationary assumptions, while real deployments unfold in time-varying environments where materials, sensors, and processes evolve. This review synthesizes what is currently known about temporal generalization in materials AI—the capacity of models to remain reliable as data distributions and underlying mechanisms change. We distinguish two dominant degradation pathways: drift, in which input statistics or input–output relationships shift over time, and model aging, in which learned representations become obsolete as systems evolve. Drawing on evidence across biosensing and wearables, electrochemical energy storage, polymer synthesis, automated laboratories, and industrial manufacturing, we summarize how temporal failures arise, how they are detected, and why they often remain silent until performance drops become consequential. We then evaluate mitigation strategies—including domain adaptation, incremental and continual learning, active data acquisition, uncertainty-aware prediction, and human–AI feedback loops—highlighting where they succeed, where they break down, and the constraints that limit their scalability in real-world settings. Finally, we identify key gaps: limited longitudinal datasets, weak standardization of temporal evaluation protocols, underexplored multimodal temporal fusion, and insufficient emphasis on prevention rather than detection. We conclude with a forward agenda for resilient materials AI built around lifecycle monitoring, benchmarkable temporal stress tests, and hybrid frameworks that integrate mechanistic knowledge with adaptive learning to sustain reliability over time.

Journal of Artificial Intelligence for Materials Science

Review | Open access | 18 July 2023 | Article: 31

Generalization in Materials AI: A Theoretical Distinction between New Compositions, New Structures, and New Physics

The integration of artificial intelligence into materials science has accelerated property prediction and high-throughput screening. Yet, the field’s progress hinges on models’ ability to generalize beyond their training distributions. Existing literature often addresses generalization in broad terms, focusing on out-of-distribution performance or extrapolation without distinguishing the qualitative nature of material novelty. This conceptual manuscript introduces a novel theoretical framework for categorizing generalization in materials AI into three distinct levels: new compositions (variations within known structural families), new structures (alternative atomic arrangements or topologies), and new physics (emergence of phenomena governed by mechanisms absent from the training data). Drawing on recent advances in graph neural networks, scalable deep learning, and materials representations, we synthesize evidence that current models achieve reasonable interpolation within familiar domains but encounter progressively greater difficulties across these levels. The proposed distinction provides a structured lens for analyzing model limitations, interpreting benchmark results, and guiding the design of future architectures and training strategies. By formalizing these categories, the framework aims to advance theoretical understanding of generalization in materials AI, emphasizing the need for targeted approaches at each level to enable reliable discovery of novel materials.

Journal of Artificial Intelligence for Materials Science

Original Research | Open access | 18 July 2024 | Article: 55

Conceptual Foundations for Adversarial Validation in Materials Machine Learning

Standard validation protocols in materials machine learning continue to rely on the assumption that training and test data are drawn from the same underlying distribution. This assumption is almost invariably violated in real-world materials datasets because of temporal drift in measurement techniques, compositional biases in database construction, and experimental confounders arising from different laboratories and instruments. This conceptual framework article proposes adversarial validation as a diagnostic tool specifically tailored for materials informatics: a method that trains a discriminator to explicitly detect whether a distribution shift exists between any two datasets, thereby revealing hidden generalization failures that conventional train-test splits and k-fold cross-validation cannot expose. The framework introduces the conceptual foundations of adversarial validation, distinguishes it from adversarial attacks, articulates why the technique is particularly powerful in the small-data, high-dimensional, and physically constrained domain of materials science, and offers a five-component structure for its systematic application—feature-space definition, classifier selection, shift-detection thresholding, localization of driving features, and actionable response rules. By embedding materials-specific domain knowledge into the interpretation of discriminator performance, the approach transforms validation from a passive checkpoint into an active diagnostic that can distinguish temporal shift from compositional bias and experimental confounding. The implications for materials AI practice are immediate and transformative: researchers can now report adversarial validation results alongside standard metrics, trigger targeted dataset augmentation or model retraining when shifts are detected, and document potential sources of distribution mismatch in experimental workflows, ultimately raising the robustness and trustworthiness of property predictions that underpin materials discovery and design.

Journal of Artificial Intelligence for Materials Science

Original Research | Open access | 18 January 2022 | Article: 99

A Conceptual Distinction between Generalization and Transfer in Materials Machine Learning

In the rapidly expanding domain of artificial intelligence applied to materials science, a persistent conceptual ambiguity undermines the reliability of reported model capabilities. The terms “generalization” and “transfer” are routinely conflated, with authors claiming that a model “generalizes” when it is in fact being evaluated on samples drawn from a distinctly different distribution. This boundary/definitional paper draws a sharp conceptual distinction between the two notions. Generalization is defined as the expected performance of a trained model on new samples drawn independently and identically from the same underlying distribution as the training data. In contrast, transfer is defined as performance on samples drawn from a different distribution, where the I.I.D. assumption is violated by construction. The distinction matters because a model that generalizes excellently within its training distribution can fail dramatically under transfer conditions, and conversely, a successful transfer mechanism may mask poor generalization; treating the two interchangeably, therefore, produces overclaims about model robustness that cannot be sustained when materials discovery moves beyond the convex hull of available training data. The paper articulates a two-dimensional boundary framework—distribution-shift magnitude and feature-space overlap—that locates any given evaluation setting along a continuum from pure generalization to pure transfer, thereby enabling authors, reviewers, and practitioners to specify precisely which capability is being claimed and tested. By clarifying these boundaries and exposing the epistemic costs of current usage, the work supplies a conceptual foundation for more disciplined reporting standards and evaluation protocols in materials machine learning.

Journal of Artificial Intelligence for Materials Science

Original Research | Open access | 18 January 2023 | Article: 108

Filters

Clear All

Subject

AI-Assisted Materials Synthesis Advanced Functional Materials Artificial Intelligence in Materials Science Computational Materials Engineering Computational Materials Science Data-Driven Materials Design Digital Materials Engineering Digital Twin for Materials Systems High-Throughput Materials Screening Integrated Computational Materials Engineering (ICME) Machine Learning for Materials Discovery Materials Characterization and Analysis Materials Characterization and Data Analysis Materials Data Analytics Materials Informatics Materials Modeling and Simulation Materials Optimization Multiscale Materials Modeling Nanomaterials Predictive Modeling of Material Properties Smart Materials Sustainable Materials Design Sustainable Materials Development

Journal

Journal of Artificial Intelligence for Materials Science Journal of Computational and Data-Driven Materials Engineering

Year

2026 2025 2024 2023 2022 2021 2020 2019 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000

Article type

Original Research Review Systematic Review Mini Review Meta-Analysis Case Report Case Study Clinical Trial Methods Methodology Article Data Report Dataset Paper Perspective Opinion Editorial Letter to the Editor Commentary General Commentary Policy and Practice Review Policy Brief Educational Material Hypothesis and Theory Short Communication Technical Report Research Report Cross-Sectional Study Cohort Study Case-Control Study Classification Correction Erratum Retraction Replication Study Philosophical Analysis Protocol Registered Report Brief Report Conference Paper Book Review Article

Access type

Open access