A curated list of awesome AI tools, libraries, papers, datasets, and frameworks that accelerate scientific discovery — from physics and chemistry to biology, materials, and beyond.
-
Updated
Nov 30, 2025
A curated list of awesome AI tools, libraries, papers, datasets, and frameworks that accelerate scientific discovery — from physics and chemistry to biology, materials, and beyond.
Democratizing AI scientists with ToolUniverse
Scientific machine learning (SciML) benchmarks, AI for science, and (differential) equation solvers. Covers Julia, Python (PyTorch, Jax), MATLAB, R
A Collection of Awesome Large Weather Models (LWMs) | AI for Earth (AI4Earth) | AI for Science (AI4Science)
The official implementation of 3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction (ICLR 2023)
[ICLR 2024] EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations
❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents
[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models
[ICLR 2023 Spotlight] Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs
Automated Hypothesis Testing with Agentic Sequential Falsifications
[updating] Chinese Medical Dataset 致力于详细整理所有现有中文医学数据集,包括详细的数据汇总、数据示例、下载链接等。
[NeurIPS'24] Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling
[Briefings In Bioinformatics] DrugAssist: A Large Language Model for Molecule Optimization
Learning the language of protein-protein interactions
Generative Pre-trained Graph Eulerian Transformer [ICML2025]
Official PyTorch implementation of PSE/PSRN: Fast and efficient symbolic expression discovery through parallelized symbolic enumeration. Evaluates millions of expressions simultaneously on GPU with automated subtree reuse.
PaddleMaterials is a data-mechanism dual-driven, foundation model development and deployment, end to end toolkit based on PaddlePaddle deep learning framework for materials science and engineering.
[ICML 2025] HypotheSAEs: Hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502.04382
Generating and validating natural-language explanations for the brain.
Add a description, image, and links to the ai-for-science topic page so that developers can more easily learn about it.
To associate your repository with the ai-for-science topic, visit your repo's landing page and select "manage topics."