Stanford AI Lab Papers and Talks at ICLR 2022

· Source: The Stanford AI Lab Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Data Science & Analytics · Depth: Advanced, medium

Summary

Stanford AI Lab (SAIL) researchers presented 19 papers at the International Conference on Learning Representations (ICLR) 2022, held virtually from April 25th to April 29th. Key contributions include "Autonomous Reinforcement Learning: Formalism and Benchmarking," which introduces a new paradigm for continual learning, and "MetaShift," a dataset for evaluating contextual distribution shifts. Other notable works explore in-context learning in large language models like GPT-3, propose GreaseLM for knowledge graph-enhanced question answering, and investigate efficient model editing techniques. Robotics research is represented by papers on vision-based manipulators and learning inter-object functional relationships in 3D scenes. Additionally, research covers sparse training for neural networks, efficient long sequence modeling with structured state spaces, and methods for assessing machine learning API shifts.

Key takeaway

For NLP engineers and research scientists working with large language models, understanding that in-context learning arises from implicit Bayesian inference on long-range coherent pre-training data is crucial. You should prioritize pre-training distributions that foster this coherence and be mindful of how prompt formatting and example ordering can impact few-shot performance, even leading to zero-shot outperforming one-shot in some cases.

Key insights

In-context learning in large language models can emerge from modeling long-range coherence in pre-training data.

Principles

Method

A proposed pre-training distribution, where documents are conditioned on a latent concept (like an HMM's hidden state transition matrix), enables in-context learning to emerge.

In practice

Topics

Code references

Best for: NLP Engineer, Research Scientist, AI Researcher, AI Scientist, AI Student

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Stanford AI Lab Blog.