๐ SpectrumLab Leaderboard
A comprehensive benchmark for evaluating large language models on spectroscopic analysis tasks.
๐ Evaluation Levels: Signal Processing, Perception, Semantic Understanding, Generation
๐ฌ Domains: IR, NMR, UV-Vis, Mass Spectrometry and more
๐ Multimodal: Support for both text-only and vision-language models
๐ Stats: 18 models evaluated
๐
Rankings: ๐ฅ๐ฅ๐ฅ medals for top performers
๐ Submit: Send evaluation results to contribute your model!
๐ท๏ธ Model Type
๐๏ธ Modality
๐ Sort By
๐ Model Rankings
๐ฅ | ๐ | Unknown | ๐๏ธ | 61.5 | 75.8 | 79.9 | 81.9 | 25.6 | 2025-08-01 |
Select Model for Details
๐ Subcategory Results
๐ Column Explanations
- Rank: ๐ฅ 1st place, ๐ฅ 2nd place, ๐ฅ 3rd place, then numbers
- Type: ๐ Open Source, ๐ Proprietary, ๐ Baseline
- MM: ๐๏ธ Multimodal, ๐ Text-only
- Overall: Average accuracy across all evaluated levels
- Signal: Low-level signal processing tasks
- Perception: Mid-level feature extraction tasks
- Semantic: High-level understanding tasks
- Generation: Spectrum generation tasks
๐ Notes
- "-" indicates the model was not evaluated on that benchmark
- Rankings are based on overall performance across all evaluated tasks
- Multimodal models can process both text and spectroscopic images
- Click on model names and submitters to visit their pages
๐ Task Categories
Signal Level:
- Spectrum Type Classification (TC)
- Spectrum Quality Assessment (QE)
- Basic Feature Extraction (FE)
- Impurity Peak Detection (ID)
Perception Level:
- Functional Group Recognition (GR)
- Elemental Compositional Prediction (EP)
- Peak Assignment (PA)
- Basic Property Prediction (PP)
Semantic Level:
- Molecular Structure Elucidation (SE)
- Fusing Spectroscopic Modalities (FM)
- Multimodal Molecular Reasoning (MR)
Generation Level:
- Forward Problems (FP)
- Inverse Problems (IP)
- De Novo Generation (DnG)