From Natural Language to Executable Narsese: A Neuro-Symbolic Benchmark and Pipeline for Reasoning with NARS

2026-04-22 · Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, extended

Summary

A new neuro-symbolic framework and benchmark, NARS-Reasoning-v0.1, has been introduced to address the unreliability of Large Language Models (LLMs) in reasoning tasks requiring explicit symbolic structure and multi-step inference. This benchmark, comprising 1,000 natural-language reasoning instances, pairs problems with First-Order Logic (FOL) forms, executable Narsese programs, and three gold labels: True, False, and Uncertain. A deterministic compiler translates FOL to Narsese, and an execution-based validation procedure in OpenNARS for Applications (ONA) ensures symbolic targets are syntactically well-formed and behaviorally aligned. The framework also defines Language-Structured Perception (LSP), where LLMs are trained to produce reasoning-relevant symbolic structure. As a proof of concept, a Phi-2 LoRA adapter was trained on NARS-Reasoning-v0.1 for three-label classification, demonstrating the benchmark's utility for supervised adaptation in addition to executable evaluation.

Key takeaway

For NLP Engineers and Research Scientists developing robust reasoning systems, this work highlights the importance of moving beyond verbal answers to executable symbolic representations. You should consider integrating neuro-symbolic pipelines that translate natural language into formal logic like Narsese, validated through execution in systems like ONA. This approach ensures that your models produce not just plausible text, but verifiable and semantically correct reasoning structures, crucial for reliable AI applications.

Key insights

Neuro-symbolic systems can enhance LLM reasoning by translating natural language into executable symbolic structures for validation.

Principles

Reasoning requires explicit symbolic structure.
Execution-based validation ensures semantic correctness.
Uncertainty is a critical reasoning outcome.

Method

Translate natural language to FOL, compile FOL to executable Narsese, and then execute in ONA to determine reasoning outcomes (True, False, Uncertain), validating symbolic programs behaviorally.

In practice

Use NARS-Reasoning-v0.1 for neuro-symbolic evaluation.
Train LLMs to generate executable symbolic programs.
Employ ONA for runtime validation of Narsese programs.

Topics

Neuro-Symbolic Reasoning
NARS-Reasoning-v0.1 Benchmark
First-Order Logic
Executable Narsese
OpenNARS for Applications

Best for: NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.