Search results:

Scientific Blind Spots Introduced by Feature Engineering in Materials Informatics

Feature engineering remains central to materials informatics, yet systematically introduces scientific blind spots that constrain discovery and interpretation. These blind spots arise from choices in descriptor selection, transformation, and dimensionality reduction that inadvertently prioritize statistical correlations over physical invariance, overlook multi-scale interactions, and embed dataset-specific biases into model architectures. In small-data regimes common to materials science, engineered features often amplify overfitting while diminishing generalizability across chemical spaces. Interpretability suffers as complex engineered descriptors obscure mechanistic linkages between atomic structure and macroscopic properties. Literature consistently highlights these limitations across perovskites, alloys, energy materials, and porous systems, underscoring the tension between predictive performance and scientific fidelity. This conceptual manuscript synthesizes these challenges and proposes an original Integrated Blind Spot Navigation Model (IBSNM). The framework organizes feature engineering around four interdependent pillars—physical consistency guardrails, multi-scale descriptor integration, uncertainty-aware selection, and iterative co-interpretation—linked by feedback mechanisms that surface and mitigate hidden assumptions. By reframing feature engineering as a navigable landscape rather than a static preprocessing step, the model offers a conceptual pathway toward more robust, transparent materials informatics practices that do not rely on empirical validation.

Journal of Artificial Intelligence for Materials Science

Original Research | Open access | 18 January 2023 | Article: 22

Feature Engineering as Scientific Framing: Encoding Choices in Materials Informatics

The advent of computational and data-driven materials engineering has transformed materials discovery by integrating machine learning with high-throughput simulations and experimental workflows. Within this ecosystem, feature engineering emerges not merely as a technical preprocessing step but as a fundamental scientific framing mechanism that encodes domain knowledge into data representations, influencing inference pathways and discovery outcomes. This conceptual manuscript explores how encoding choices in materials informatics shape epistemic structures, steering computational pipelines from raw multimodal datasets to inverse design strategies. We identify a conceptual gap in current paradigms, where representation decisions often remain implicit, leading to unexamined trade-offs in uncertainty propagation and model interpretability. To address this, we introduce the Encoding Dynamics Framework (EDF), a systems-level architecture that conceptualizes feature engineering as an interactive layer between data infrastructures and AI-guided discovery systems. EDF highlights feedback loops where encoding selections modulate representation learning, graph neural networks, and closed-loop experimentation, fostering more robust computational steering logics. Implications extend to foundation models for materials science, simulation-experiment coupling, and uncertainty quantification, promoting infrastructures that align encoding with scientific inquiry goals. By reframing feature engineering as epistemic framing, this work advances interpretive insights into how data encoding choices drive materials innovation without empirical validation.

Journal of Computational and Data-Driven Materials Engineering

Original Research | Open access | 18 March 2023 | Article: 99

Filters

Clear All

Subject

AI-Assisted Materials Synthesis Advanced Functional Materials Artificial Intelligence in Materials Science Computational Materials Engineering Computational Materials Science Data-Driven Materials Design Digital Materials Engineering Digital Twin for Materials Systems High-Throughput Materials Screening Integrated Computational Materials Engineering (ICME) Machine Learning for Materials Discovery Materials Characterization and Analysis Materials Characterization and Data Analysis Materials Data Analytics Materials Informatics Materials Modeling and Simulation Materials Optimization Multiscale Materials Modeling Nanomaterials Predictive Modeling of Material Properties Smart Materials Sustainable Materials Design Sustainable Materials Development

Journal

Journal of Artificial Intelligence for Materials Science Journal of Computational and Data-Driven Materials Engineering

Year

2026 2025 2024 2023 2022 2021 2020 2019 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000

Article type

Original Research Review Systematic Review Mini Review Meta-Analysis Case Report Case Study Clinical Trial Methods Methodology Article Data Report Dataset Paper Perspective Opinion Editorial Letter to the Editor Commentary General Commentary Policy and Practice Review Policy Brief Educational Material Hypothesis and Theory Short Communication Technical Report Research Report Cross-Sectional Study Cohort Study Case-Control Study Classification Correction Erratum Retraction Replication Study Philosophical Analysis Protocol Registered Report Brief Report Conference Paper Book Review Article

Access type

Open access