Bolek: A Multimodal Language Model for Molecular Reasoning

· Source: Machine Learning · Field: Science & Research — Artificial Intelligence & Machine Learning, Life Sciences & Biology, Health & Medical Research · Depth: Expert, quick

Summary

Bolek is a compact multimodal language model designed for molecular reasoning, addressing the auditability challenges of existing drug-discovery models. It integrates a Morgan fingerprint embedding into an instruction-tuned text decoder, grounding natural-language reasoning in molecular structure. Fine-tuned on molecular alignment tasks like description, RDKit descriptor prediction, and substructure detection, Bolek also performs downstream reasoning on 15 TDC binary classification tasks using synthetic chains-of-thought. The model significantly outperforms its Qwen3-4B-Instruct base, increasing mean ROC/PR AUC from 0.55 to 0.76, and surpasses TxGemma-9B-Chat on most classification tasks despite being smaller. Bolek's explanations are more grounded, citing numerical descriptors 10-100 times more frequently with strong agreement to RDKit values for key descriptors.

Key takeaway

For AI engineers developing drug-discovery models, Bolek demonstrates that integrating molecular structure embeddings directly into language models can yield more auditable and accurate reasoning. You should consider multimodal input and synthetic, feature-anchored chains-of-thought to improve model performance and the verifiability of generated explanations, especially when working with compact models for high-stakes applications.

Key insights

Targeted multimodal injection and reasoning supervision create auditable, compact molecular reasoning models.

Principles

Method

Bolek injects Morgan fingerprint embeddings into a Qwen3-4B-Instruct text decoder, then fine-tunes on molecular alignment and 15 TDC binary classification tasks using synthetic chains-of-thought anchored in molecular features.

In practice

Topics

Best for: AI Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.