From Natural Language to Executable Narsese: A Neuro-Symbolic Benchmark and Pipeline for Reasoning with NARS
Summary
A new neuro-symbolic framework and benchmark, NARS-Reasoning-v0.1, has been introduced to address the unreliability of Large Language Models (LLMs) in reasoning tasks requiring explicit symbolic structure and multi-step inference. This benchmark, comprising 1,000 natural-language reasoning instances, pairs problems with First-Order Logic (FOL) forms, executable Narsese programs, and three gold labels: True, False, and Uncertain. A deterministic compiler translates FOL to Narsese, and an execution-based validation procedure in OpenNARS for Applications (ONA) ensures symbolic targets are syntactically well-formed and behaviorally aligned. The framework also defines Language-Structured Perception (LSP), where LLMs are trained to produce reasoning-relevant symbolic structure. As a proof of concept, a Phi-2 LoRA adapter was trained on NARS-Reasoning-v0.1 for three-label classification, demonstrating the benchmark's utility for supervised adaptation in addition to executable evaluation.
Key takeaway
For NLP Engineers and Research Scientists developing robust reasoning systems, this work highlights the importance of moving beyond verbal answers to executable symbolic representations. You should consider integrating neuro-symbolic pipelines that translate natural language into formal logic like Narsese, validated through execution in systems like ONA. This approach ensures that your models produce not just plausible text, but verifiable and semantically correct reasoning structures, crucial for reliable AI applications.
Key insights
Neuro-symbolic systems can enhance LLM reasoning by translating natural language into executable symbolic structures for validation.
Principles
- Reasoning requires explicit symbolic structure.
- Execution-based validation ensures semantic correctness.
- Uncertainty is a critical reasoning outcome.
Method
Translate natural language to FOL, compile FOL to executable Narsese, and then execute in ONA to determine reasoning outcomes (True, False, Uncertain), validating symbolic programs behaviorally.
In practice
- Use NARS-Reasoning-v0.1 for neuro-symbolic evaluation.
- Train LLMs to generate executable symbolic programs.
- Employ ONA for runtime validation of Narsese programs.
Topics
- Neuro-Symbolic Reasoning
- NARS-Reasoning-v0.1 Benchmark
- First-Order Logic
- Executable Narsese
- OpenNARS for Applications
Best for: NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.