๐Ÿ† SpectrumLab Leaderboard

A comprehensive benchmark for evaluating large language models on spectroscopic analysis tasks.

๐Ÿ“Š Evaluation Levels: Signal Processing, Perception, Semantic Understanding, Generation
๐Ÿ”ฌ Domains: IR, NMR, UV-Vis, Mass Spectrometry and more
๐ŸŒŸ Multimodal: Support for both text-only and vision-language models

๐Ÿ“ˆ Stats: 18 models evaluated
๐Ÿ… Rankings: ๐Ÿฅ‡๐Ÿฅˆ๐Ÿฅ‰ medals for top performers
๐Ÿ”— Submit: Send evaluation results to contribute your model!

๐Ÿท๏ธ Model Type
๐Ÿ‘๏ธ Modality
๐Ÿ“Š Sort By

๐Ÿ† Model Rankings

Select Model for Details

๐Ÿ“Š Subcategory Results

๐Ÿ” Column Explanations

  • Rank: ๐Ÿฅ‡ 1st place, ๐Ÿฅˆ 2nd place, ๐Ÿฅ‰ 3rd place, then numbers
  • Type: ๐Ÿ”“ Open Source, ๐Ÿ”’ Proprietary, ๐Ÿ“Š Baseline
  • MM: ๐Ÿ‘๏ธ Multimodal, ๐Ÿ“ Text-only
  • Overall: Average accuracy across all evaluated levels
  • Signal: Low-level signal processing tasks
  • Perception: Mid-level feature extraction tasks
  • Semantic: High-level understanding tasks
  • Generation: Spectrum generation tasks

๐Ÿ“ Notes

  • "-" indicates the model was not evaluated on that benchmark
  • Rankings are based on overall performance across all evaluated tasks
  • Multimodal models can process both text and spectroscopic images
  • Click on model names and submitters to visit their pages

๐Ÿ“Š Task Categories

Signal Level:

  • Spectrum Type Classification (TC)
  • Spectrum Quality Assessment (QE)
  • Basic Feature Extraction (FE)
  • Impurity Peak Detection (ID)

Perception Level:

  • Functional Group Recognition (GR)
  • Elemental Compositional Prediction (EP)
  • Peak Assignment (PA)
  • Basic Property Prediction (PP)

Semantic Level:

  • Molecular Structure Elucidation (SE)
  • Fusing Spectroscopic Modalities (FM)
  • Multimodal Molecular Reasoning (MR)

Generation Level:

  • Forward Problems (FP)
  • Inverse Problems (IP)
  • De Novo Generation (DnG)