SciDER: Scientific Data-centric End-to-end Researcher

An autonomous, multimodal system that parses raw experimental data, generates hypotheses via Evolutionary Idea Search, and synthesizes executable code across scientific domains.

William & Mary · MBZUAI · University of Minnesota Ke Lin · Owais Aijaz · Yilin Lu · Xuehang Guo · Preslav Nakov

Authors

Ke Lin · William & Mary
Owais Aijaz · MBZUAI
Yilin Lu · University of Minnesota
Yiyang Luo · HKUST
Xuehang Guo · William & Mary
Preslav Nakov · MBZUAI

Contact

leonard.keilin@gmail.com
owais.aijaz@mbzuai.ac.ae
lu000661@umn.edu
yluodq@connect.ust.hk
xguo15@wm.edu
preslav.nakov@mbzuai.ac.ae

Abstract

While large language models accelerate scientific discovery, existing agents struggle to autonomously process raw, multimodal, and domain-specific experimental data. We introduce SciDER, an autonomous system that automates the research lifecycle. Equipped with a dynamic multimodal skill system, SciDER uses specialized agents to generate hypotheses via Evolutionary Idea Search, analyze raw data, and synthesize executable code grounded in data characteristics. These processes are refined iteratively by a critic-led feedback loop. To support open-source research, we release OpenSciDER-SFT-8K, a curated trajectory dataset, and the OpenSciDER-27B fine-tuned model. Comprehensive evaluations across ideation, multimodal data analysis, and coding benchmarks show that SciDER outperforms state-of-the-art agents, bridging the gap between scientific reasoning and experimental synthesis.

System Highlights

Multimodal skill system. Dynamically loads domain-specific skills across disciplines.
Evolutionary Idea Search. Ranks and evolves candidate hypotheses via an LLM judge.
Data-centric analysis. Profiles raw data by structure, quality, semantics, and dependency.
Critic-led feedback. Iteratively refines ideation, analysis, and experiments.

Release Snapshot

Open-source codebase on GitHub (LangGraph + litellm, Apache-2.0).
OpenSciDER-SFT-8K dataset of 8,532 curated research trajectories.
OpenSciDER-27B fine-tuned model built on the Qwen3.6-27B backbone.
PyPI package and lightweight web interface for end-to-end runs.

Agent Comparison

Comparison figure between general agents and SciDER

Demo Walkthrough

  1. Upload raw experimental data or connect lab storage.
  2. SciDER parses observations, protocols, and metadata.
  3. Hypothesis generator proposes next-step experiments.
  4. Critic validates feasibility and novelty.

Demo Links

Paper: arxiv.org/abs/2603.01421
Live demo: huggingface.co/spaces/AI4Research/scider
Project page: leonardodalinky.github.io/scider-proj-page/
SciDER workflow selection panel
Web interface of the SciDER Research Assistant showing the workflow selection panel. Users can initiate different research workflows—Ideation, Data Analysis, Experiment, or a Full Workflow—through a set of interactive buttons, enabling the system to support various stages of the scientific research process from idea generation to experimental execution.
SciDER case study selection
Web interface of the SciDER Research Assistant demonstrating the selection of a case study for a full research workflow. In this example, the system loads a Kepler Exoplanet dataset task, where the agent autonomously conducts ideation, feature hypothesis generation, model training, and experimental evaluation to detect exoplanet transit signals from stellar light curves.

Resources

Paper

Our preprint details the multimodal skill system, Evolutionary Idea Search, and critic-led feedback loop, with evaluations across ideation, data analysis, and coding benchmarks, plus the OpenSciDER dataset and model.

📄 arXiv:2603.01421

Open Questions

We are actively addressing error cascading in long-horizon multi-agent workflows and closing the performance gap between OpenSciDER-27B and frontier proprietary models. Collaborators are welcome.