Source Trail &amp; Reference Library

Research from the creators of GPT and DALL-E

llmgptsafetyalignment

DeepMind

weekly / credibility 5/5

Google's AI research lab - AlphaGo, AlphaFold, Gemini

rlalphafoldneuroscience

Meta AI (FAIR)

weekly / credibility 5/5

Meta's Fundamental AI Research - PyTorch, LLaMA models

cvnlpllamapytorch

Anthropic

monthly / credibility 5/5

claudeconstitutional-aiinterpretability

AI safety company - Constitutional AI, Claude models

Google AI Blog

weekly / credibility 5/5

Current researchuse-for-benchmark-context-verify-primary-paper

Google's AI research updates and applied ML insights

appliedresearchproducts

Discovery & Tooling

Papers with Code

daily / credibility 5/5

Papers + code implementations, SOTA benchmarks

benchmarkssotaimplementations

Hugging Face

daily / credibility 5/5

Current researchuse-for-model-and-dataset-discovery-verify-primary-source

Model hub, datasets, and Transformers library

modelsdatasetstransformers

AI Alignment Forum

daily / credibility 4/5

Needs reviewuse-for-safety-research-discovery-human-review-required

Community forum for AI safety research discussion

safetyalignmentrisks

Analysis & Newsletters

The Batch (DeepLearning.AI)

weekly / credibility 5/5

Needs reviewuse-for-discovery-not-final-citation

Andrew Ng's weekly AI newsletter - accessible explanations

newsletteraccessiblecurated

Import AI

weekly / credibility 4/5

Needs reviewuse-for-discovery-not-final-citation

Jack Clark's newsletter on AI policy and research

newsletteranalysispolicy

Pulled Research Leads

Recent cached paper pulls, grouped as research leads until reviewed.

Open live feed

Current researchAutomated pullcs.AI-2026-05-22.json

LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent Systems

Large language model (LLM)-based multi-agent systems increasingly rely on intermediate communication to coordinate complex tasks. While most existing systems communicate through natural language, recent work shows that latent communication, particularly through transformer key-value (KV) caches, can improve efficiency and preserve richer task-relevant information. However, KV caches also encode contextual inputs, intermediate reasoning states, and agent-specific information, creating an opaque channel through which sensitive content may propagate across agents without explicit textual disclosure. To address this, we introduce \textbf{LCGuard} (Latent Communication Guard), a framework for safe KV-based latent communication in multi-agent LLM systems. LCGuard treats shared KV caches as latent working memory and learns representation-level transformations before cache artifacts are transmitted across agents. We formalize representation-level sensitive information leakage operationally through reconstruction: a shared cache artifact is unsafe if an adversarial decoder can recover agent-specific sensitive inputs from it. This leads to an adversarial training formulation in which the adversary learns to reconstruct sensitive inputs, while LCGuard learns transformations that preserve task-relevant semantics and reduce reconstructable information. Empirical evaluations across multiple model families and multi-agent benchmarks show that LCGuard consistently reduces reconstruction-based leakage and attack success rates while maintaining competitive task performance compared to standard KV-sharing baselines.

5/21/2026/Sadia Asif, Mohammad Mohammadi Amiri, Momin Abbas et al./ 32% relevance

Current researchAutomated pullcs.AI-2026-05-22.json

Towards a General Intelligence and Interface for Wearable Health Data

While ubiquitous wearable sensors capture a wealth of behavioral and physiological information, effectively transforming these signals into personalized health insights is challenging. Specifically, converting low-level sensor data into representations capable of characterizing higher-level states is difficult due to high phenotypic diversity and variation in individual baseline health, physiology, and lifestyle factors. Moreover, collecting wearable data paired with health outcome annotations is laborious and expensive, and retrospective annotation remains practically unfeasible, contributing to a scarcity of data with high-quality labels. To overcome these limitations, we propose a foundation model for wearable health that is pretrained on more than one trillion minutes of unlabeled sensor signals drawn from a large cohort of five million participants. We demonstrate that the joint scaling of model capacity and pretraining data volume leads to systematic improvements in performance, as evaluated on a diverse set of 35 health prediction tasks, spanning cardiovascular, metabolic, sleep, and mental health, as well as lifestyle choices and demographic factors. We find that this population scale representation unlocks label-efficient few-shot learning and generative capabilities for robust daily metric estimation. To further leverage this learned representation, we deploy a classroom of LLM agents to autonomously search the space of downstream predictive heads built on the model embeddings, showing broad performance improvements that increase with LLM model capacity. Finally, we show how integrating these downstream predictors into a Personal Health Agent can support model responses that are more relevant, contextually aware, and safe, and we validate this via 1,860 ratings from a cohort of clinicians.

5/21/2026/Girish Narayanswamy, Maxwell A. Xu, A. Ali Heydari et al./ 32% relevance

Current researchAutomated pullcs.AI-2026-05-22.json

More Context, Larger Models, or Moral Knowledge? A Systematic Study of Schwartz Value Detection in Political Texts

Detecting Schwartz values in political text is difficult because implicit cues often depend on surrounding arguments and fine-grained distinctions between neighboring values. We study when context and explicit moral knowledge help sentence-level value detection. Using the ValuesML/Touch{é} ValueEval format, we compare sentence, window, and full-document inputs; no-RAG and retrieval-augmented settings with a curated moral knowledge base; supervised DeBERTa-v3-base/large encoders; and zero-shot LLMs from 12B to 123B parameters. The results show that more context is not uniformly better: full-document context improves supervised DeBERTa encoders by 3.8--4.8 macro-F1 points over sentence-only input, but does not consistently help zero-shot LLMs. Retrieved moral knowledge is more consistently useful in matched comparisons, improving each tested model family and context condition under early fusion. However, scaling from DeBERTa-v3-base to large and from 12B to larger LLMs does not guarantee gains, and simple early fusion outperforms the tested late-fusion and cross-attention RAG variants for encoders. Per-value analyses show that context and retrieval help most for socially situated or conceptually confusable values. These findings suggest that value-sensitive NLP should evaluate context, knowledge, and model family jointly rather than treating longer inputs or larger models as universal improvements.

5/21/2026/Víctor Yeste, Paolo Rosso/ 32% relevance

Current researchAutomated pullcs.AI-2026-05-22.json

Think Thrice Before You Speak: Dual knowledge-enhanced Theory-of-Mind Reasoning for Persuasive Agents

Persuasive dialogue requires reasoning about others' latent mental states, a capability known as Theory of Mind (ToM). However, due to reliance on simple prompting strategies and insufficient ToM knowledge, existing LLMs often fail to capture the intrinsic dependencies among mental states, leading to fragmented representations and unstable reasoning. To address these challenges, we introduce the ToM-based Persuasive Dialogue (ToM-PD) task, grounded in the Belief-Desire-Intention (BDI) framework, which explicitly models the sequential dependencies among mental states in multi-turn dialogues. To facilitate research on this task, we construct a large-scale annotated dataset, ToM-based Broad Persuasive Dialogues (ToM-BPD), capturing fine-grained mental states and corresponding persuasive strategies. We further propose Think Thrice Before You Speak (TTBYS), a knowledge-enhanced stepwise reasoning framework that leverages both explicit and implicit prior experiences to improve LLMs' inference of desires, beliefs, and persuasive strategies. Experimental results demonstrate that Qwen3-8B equipped with TTBYS outperforms GPT-5 by 1.20%, 22.80%, and 16.97% in predicting desires, beliefs, and persuasive strategies, respectively. Case studies further show that our approach enhances interpretability and consistency in reasoning.

5/21/2026/Minghui Ma, Bin Guo, Runze Yang et al./ 32% relevance

Needs reviewNeeds reviewcs.AI-2025-11-17.json

iMAD: Intelligent Multi-Agent Debate for Efficient and Accurate LLM Inference

Large Language Model (LLM) agent systems have advanced rapidly, driven by their strong generalization in zero-shot settings. To further enhance reasoning and accuracy on complex tasks, Multi-Agent Debate (MAD) has emerged as a promising framework that engages multiple LLM agents in structured debates to encourage diverse reasoning. However, triggering MAD for every query is inefficient, as it incurs substantial computational (token) cost and may even degrade accuracy by overturning correct single-agent answers. To address these limitations, we propose intelligent Multi-Agent Debate (iMAD), a token-efficient framework that selectively triggers MAD only when it is likely to be beneficial (i.e., correcting an initially wrong answer). To achieve this goal, iMAD learns generalizable model behaviors to make accurate debate decisions. Specifically, iMAD first prompts a single agent to produce a structured self-critique response, from which we extract 41 interpretable linguistic and semantic features capturing hesitation cues. Then, iMAD uses a lightweight debate-decision classifier, trained using our proposed FocusCal loss, to determine whether to trigger MAD, enabling robust debate decisions without test dataset-specific tuning. Through extensive experiments using six (visual) question answering datasets against five competitive baselines, we have shown that iMAD significantly reduces token usage (by up to 92%) while also improving final answer accuracy (by up to 13.5%).

11/14/2025/Wei Fan, JinYi Yoon, Bo Ji/ 32% relevance

Needs reviewNeeds reviewcs.AI-2025-11-17.json

UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios

Autonomous aerial systems increasingly rely on large language models (LLMs) for mission planning, perception, and decision-making, yet the lack of standardized and physically grounded benchmarks limits systematic evaluation of their reasoning capabilities. To address this gap, we introduce UAVBench, an open benchmark dataset comprising 50,000 validated UAV flight scenarios generated through taxonomy-guided LLM prompting and multi-stage safety validation. Each scenario is encoded in a structured JSON schema that includes mission objectives, vehicle configuration, environmental conditions, and quantitative risk labels, providing a unified representation of UAV operations across diverse domains. Building on this foundation, we present UAVBench_MCQ, a reasoning-oriented extension containing 50,000 multiple-choice questions spanning ten cognitive and ethical reasoning styles, ranging from aerodynamics and navigation to multi-agent coordination and integrated reasoning. This framework enables interpretable and machine-checkable assessment of UAV-specific cognition under realistic operational contexts. We evaluate 32 state-of-the-art LLMs, including GPT-5, ChatGPT-4o, Gemini 2.5 Flash, DeepSeek V3, Qwen3 235B, and ERNIE 4.5 300B, and find strong performance in perception and policy reasoning but persistent challenges in ethics-aware and resource-constrained decision-making. UAVBench establishes a reproducible and physically grounded foundation for benchmarking agentic AI in autonomous aerial systems and advancing next-generation UAV reasoning intelligence. To support open science and reproducibility, we release the UAVBench dataset, the UAVBench_MCQ benchmark, evaluation scripts, and all related materials on GitHub at https://github.com/maferrag/UAVBench

11/14/2025/Mohamed Amine Ferrag, Abderrahmane Lakas, Merouane Debbah/ 37% relevance

Citable Reference Library

Year	Reference	Status	Source
2025	Copyright and Artificial Intelligence U.S. Copyright Office Report on Copyright and Artificial Intelligence	Canonical	Source
2024	Mixtral of Experts Jiang, A. Q., Sablayrolles, A., Roux, A., Mensch, A., et al. arXiv preprint	Current research	arXiv:2401.04088
2024	The Claude Model Card and Evaluations Anthropic Anthropic Technical Report	Industry-reported	Source
2024	The Llama 3 Herd of Models Dubey, A., Jauhri, A., Pandey, A., et al. arXiv preprint	Current research	arXiv:2407.21783
2024	Video Generation Models as World Simulators OpenAI OpenAI Technical Report	Industry-reported	Source
2023	Gemini: A Family of Highly Capable Multimodal Models Gemini Team, Google DeepMind arXiv preprint	Current research	arXiv:2312.11805
2023	Generative AI at Work Brynjolfsson, E., Li, D., & Raymond, L. R. NBER Working Paper No. 31161	Canonical	Source
2023	GPT-4 Technical Report OpenAI arXiv preprint	Current research	arXiv:2303.08774
2023	GPT-4V(ision) System Card Achiam, J., Adler, S., Agarwal, S., et al. OpenAI Technical Report	Industry-reported	Source
2023	Llama 2: Open Foundation and Fine-Tuned Chat Models Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., et al. arXiv preprint	Current research	arXiv:2307.09288
2023	Mamba: Linear-Time Sequence Modeling with Selective State Spaces Gu, A., & Dao, T. arXiv preprint	Current research	arXiv:2312.00752
2023	Reflexion: Language Agents with Verbal Reinforcement Learning Shinn, N., Cassano, F., Gopinath, A., Narasimhan, K., & Yao, S. NeurIPS 2023	Current research	arXiv:2303.11366
2023	Toolformer: Language Models Can Teach Themselves to Use Tools Schick, T., Dwivedi-Yu, J., Dessì, R., Raileanu, R., Lomeli, M., et al. arXiv preprint	Current research	arXiv:2302.04761
2023	Visual Instruction Tuning Liu, H., Li, C., Wu, Q., & Lee, Y. J. NeurIPS 2023	Current research	arXiv:2304.08485
2022	Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., et al. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2201.11903
2022	FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Dao, T., Fu, D. Y., Ermon, S., Rudra, A., & Ré, C. NeurIPS 2022	Canonical	arXiv:2205.14135
2022	Large Language Models Are Human-Level Prompt Engineers Zhou, Y., Muresanu, A. I., Han, Z., Paster, K., Pitis, S., et al. ICLR 2023	Canonical	arXiv:2211.01910
2022	ReAct: Synergizing Reasoning and Acting in Language Models Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., et al. ICLR 2023	Canonical	arXiv:2210.03629
2022	Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback Bai, Y., Jones, A., Ndousse, K., Askell, A., Chen, A., et al. arXiv preprint	Canonical	arXiv:2204.05862
2022	Training Compute-Optimal Large Language Models Hoffmann, J., Borgeaud, S., Mensch, A., Buchatskaya, E., Cai, T., et al. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2203.15556
2022	Training language models to follow instructions with human feedback Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., et al. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2203.02155
2021	An Overview of Catastrophic AI Risks Hendrycks, D., Mazeika, M., & Woodside, T. arXiv preprint	Canonical	arXiv:2306.12001
2021	Diffusion Models Beat GANs on Image Synthesis Dhariwal, P., & Nichol, A. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2105.05233
2021	High-Resolution Image Synthesis with Latent Diffusion Models Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. CVPR 2022	Canonical	arXiv:2112.10752
2021	LoRA: Low-Rank Adaptation of Large Language Models Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., et al. ICLR 2022	Canonical	arXiv:2106.09685
2021	On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S. FAccT 2021	Canonical	DOI:10.1145/3442188.3445922
2021	Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm Reynolds, L., & McDonell, K. CHI EA 2021	Canonical	arXiv:2102.07350
2020	An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., et al. ICLR 2021	Canonical	arXiv:2010.11929
2020	Denoising Diffusion Probabilistic Models Ho, J., Jain, A., & Abbeel, P. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2006.11239
2020	Dense Passage Retrieval for Open-Domain Question Answering Karpukhin, V., Oğuz, B., Min, S., Lewis, P., Wu, L., et al. EMNLP 2020	Canonical	arXiv:2004.04906
2020	Language Models are Few-Shot Learners Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., et al. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2005.14165
2020	Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., et al. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:2005.11401
2020	Scaling Laws for Neural Language Models Kaplan, J., McCandlish, S., Henighan, T., Brown, T. B., Chess, B., et al. arXiv preprint	Canonical	arXiv:2001.08361
2020	Score-Based Generative Modeling through Stochastic Differential Equations Song, Y., Sohl-Dickstein, J., Kingma, D. P., Kumar, A., Ermon, S., & Poole, B. ICLR 2021	Canonical	arXiv:2011.13456
2019	Fairness and Abstraction in Sociotechnical Systems Selbst, A. D., Boyd, D., Friedler, S. A., Venkatasubramanian, S., & Vertesi, J. FAT* 2019	Canonical	DOI:10.1145/3287560.3287598
2019	Human Compatible: Artificial Intelligence and the Problem of Control Russell, S. Viking Press	Canonical	Source
2019	Language Models are Unsupervised Multitask Learners Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., & Sutskever, I. OpenAI Technical Report	Industry-reported	Source
2019	Model Cards for Model Reporting Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., et al. FAT* 2019	Canonical	arXiv:1810.03993
2019	What is the evidence on the role of the arts in improving health and well-being? A scoping review Fancourt, D., & Finn, S. World Health Organization Regional Office for Europe	Canonical	Source
2018	BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. NAACL-HLT 2019	Canonical	arXiv:1810.04805
2018	Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification Buolamwini, J., & Gebru, T. Conference on Fairness, Accountability and Transparency (FAccT)	Canonical	Source
2018	Reinforcement Learning: An Introduction (2nd Edition) Sutton, R. S., & Barto, A. G. MIT Press	Canonical	Source
2018	Universal Language Model Fine-tuning for Text Classification Howard, J., & Ruder, S. ACL 2018	Canonical	arXiv:1801.06146
2017	Attention Is All You Need Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:1706.03762
2017	CAN: Creative Adversarial Networks, Generating Art by Learning About Styles and Deviating from Style Norms Elgammal, A., Liu, B., Elhoseiny, M., & Mazzone, M. International Conference on Computational Creativity	Canonical	arXiv:1706.07068
2017	Deep Reinforcement Learning from Human Preferences Christiano, P. F., Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	arXiv:1706.03741
2017	Proximal Policy Optimization Algorithms Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O. arXiv preprint	Canonical	arXiv:1707.06347
2016	Concrete Problems in AI Safety Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schulman, J., & Mané, D. arXiv preprint	Canonical	arXiv:1606.06565
2016	Deep Learning Goodfellow, I., Bengio, Y., & Courville, A. MIT Press	Canonical	Source
2016	Mastering the game of Go with deep neural networks and tree search Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., et al. Nature, 529(7587), 484-489	Canonical	DOI:10.1038/nature16961
2015	Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift Ioffe, S., & Szegedy, C. ICML 2015	Canonical	arXiv:1502.03167
2015	Deep Residual Learning for Image Recognition He, K., Zhang, X., Ren, S., & Sun, J. CVPR 2016	Canonical	arXiv:1512.03385
2014	Adam: A Method for Stochastic Optimization Kingma, D. P., & Ba, J. ICLR 2015	Canonical	arXiv:1412.6980
2014	Going Deeper with Convolutions Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., et al. CVPR 2015	Canonical	arXiv:1409.4842
2014	Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., et al. EMNLP 2014	Canonical	arXiv:1406.1078
2014	Neural Machine Translation by Jointly Learning to Align and Translate Bahdanau, D., Cho, K., & Bengio, Y. ICLR 2015	Canonical	arXiv:1409.0473
2014	Superintelligence: Paths, Dangers, Strategies Bostrom, N. Oxford University Press	Canonical	Source
2014	Very Deep Convolutional Networks for Large-Scale Image Recognition Simonyan, K., & Zisserman, A. ICLR 2015	Canonical	arXiv:1409.1556
2013	Playing Atari with Deep Reinforcement Learning Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., et al. NIPS Deep Learning Workshop	Canonical	arXiv:1312.5602
2012	ImageNet Classification with Deep Convolutional Neural Networks Krizhevsky, A., Sutskever, I., & Hinton, G. E. Advances in Neural Information Processing Systems (NeurIPS)	Canonical	DOI:10.1145/3065386
2012	Improving neural networks by preventing co-adaptation of feature detectors Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. R. arXiv preprint	Canonical	arXiv:1207.0580
2010	The Connection Between Art, Healing, and Public Health: A Review of Current Literature Stuckey, H. L., & Nobel, J. American Journal of Public Health, 100(2), 254-263	Canonical	DOI:10.2105/AJPH.2008.156497
2006	Pattern Recognition and Machine Learning Bishop, C. M. Springer	Canonical	Source
1997	Long Short-Term Memory Hochreiter, S., & Schmidhuber, J. Neural Computation, 9(8), 1735-1780	Canonical	DOI:10.1162/neco.1997.9.8.1735
1989	Backpropagation Applied to Handwritten Zip Code Recognition LeCun, Y., Boser, B., Denker, J. S., Henderson, D., Howard, R. E., Hubbard, W., & Jackel, L. D. Neural Computation, 1(4), 541-551	Canonical	DOI:10.1162/neco.1989.1.4.541
1986	Learning representations by back-propagating errors Rumelhart, D. E., Hinton, G. E., & Williams, R. J. Nature, 323(6088), 533-536	Canonical	DOI:10.1038/323533a0
1982	Art Worlds Becker, H. S. University of California Press	Canonical	Source
1958	The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain Rosenblatt, F. Psychological Review, 65(6), 386-408	Canonical	DOI:10.1037/h0042519
1955	A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence McCarthy, J., Minsky, M., Rochester, N., & Shannon, C. AI Magazine, 27(4)	Canonical	Source
1950	Computing Machinery and Intelligence Turing, A. M. Mind, 59(236), 433-460	Canonical	DOI:10.1093/mind/LIX.236.433
1935	The Work of Art in the Age of Mechanical Reproduction Benjamin, W. Essay	Canonical	Source

Current researchAutomated pullcs.AI-2026-05-22.json

Towards a General Intelligence and Interface for Wearable Health Data

5/21/2026/Girish Narayanswamy, Maxwell A. Xu, A. Ali Heydari et al./ 32% relevance

Year

Reference

Status

Source

2025

U.S. Copyright Office

Report on Copyright and Artificial Intelligence

Canonical

2024

Mixtral of Experts

Jiang, A. Q., Sablayrolles, A., Roux, A., Mensch, A., et al.

arXiv preprint

Current research

arXiv:2401.04088

2024

The Claude Model Card and Evaluations

Anthropic

Anthropic Technical Report

Industry-reported

2024

The Llama 3 Herd of Models

Dubey, A., Jauhri, A., Pandey, A., et al.

arXiv preprint

Current research

arXiv:2407.21783

2024

Video Generation Models as World Simulators

OpenAI

OpenAI Technical Report

Industry-reported

2023

Gemini: A Family of Highly Capable Multimodal Models

Gemini Team, Google DeepMind

arXiv preprint

Current research

arXiv:2312.11805

2023

Generative AI at Work

Brynjolfsson, E., Li, D., & Raymond, L. R.

NBER Working Paper No. 31161

Canonical

2023

GPT-4 Technical Report

OpenAI

arXiv preprint

Current research

arXiv:2303.08774

2023

GPT-4V(ision) System Card

Achiam, J., Adler, S., Agarwal, S., et al.

OpenAI Technical Report

Industry-reported

DOI:10.1145/3442188.3445922

2023

Llama 2: Open Foundation and Fine-Tuned Chat Models

Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., et al.

arXiv preprint

Current research

arXiv:2307.09288

2023

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Gu, A., & Dao, T.

arXiv preprint

Current research

arXiv:2312.00752

2023

Reflexion: Language Agents with Verbal Reinforcement Learning

Shinn, N., Cassano, F., Gopinath, A., Narasimhan, K., & Yao, S.

NeurIPS 2023

Current research

arXiv:2303.11366

2023

Toolformer: Language Models Can Teach Themselves to Use Tools

Schick, T., Dwivedi-Yu, J., Dessì, R., Raileanu, R., Lomeli, M., et al.

arXiv preprint

Current research

arXiv:2302.04761

2023

Visual Instruction Tuning

Liu, H., Li, C., Wu, Q., & Lee, Y. J.

NeurIPS 2023

Current research

arXiv:2304.08485

2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., et al.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2201.11903

2022

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Dao, T., Fu, D. Y., Ermon, S., Rudra, A., & Ré, C.

NeurIPS 2022

Canonical

arXiv:2205.14135

2022

Large Language Models Are Human-Level Prompt Engineers

Zhou, Y., Muresanu, A. I., Han, Z., Paster, K., Pitis, S., et al.

ICLR 2023

Canonical

arXiv:2211.01910

2022

ReAct: Synergizing Reasoning and Acting in Language Models

Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., et al.

ICLR 2023

Canonical

arXiv:2210.03629

2022

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Bai, Y., Jones, A., Ndousse, K., Askell, A., Chen, A., et al.

arXiv preprint

Canonical

arXiv:2204.05862

2022

Training Compute-Optimal Large Language Models

Hoffmann, J., Borgeaud, S., Mensch, A., Buchatskaya, E., Cai, T., et al.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2203.15556

2022

Training language models to follow instructions with human feedback

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., et al.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2203.02155

2021

An Overview of Catastrophic AI Risks

Hendrycks, D., Mazeika, M., & Woodside, T.

arXiv preprint

Canonical

arXiv:2306.12001

2021

Diffusion Models Beat GANs on Image Synthesis

Dhariwal, P., & Nichol, A.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2105.05233

2021

High-Resolution Image Synthesis with Latent Diffusion Models

Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B.

CVPR 2022

Canonical

arXiv:2112.10752

2021

LoRA: Low-Rank Adaptation of Large Language Models

Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., et al.

ICLR 2022

Canonical

arXiv:2106.09685

2021

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?

Bender, E. M., Gebru, T., McMillan-Major, A., & Shmitchell, S.

FAccT 2021

Canonical

2021

Prompt Programming for Large Language Models: Beyond the Few-Shot Paradigm

Reynolds, L., & McDonell, K.

CHI EA 2021

Canonical

arXiv:2102.07350

2020

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., et al.

ICLR 2021

Canonical

arXiv:2010.11929

2020

Denoising Diffusion Probabilistic Models

Ho, J., Jain, A., & Abbeel, P.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2006.11239

2020

Dense Passage Retrieval for Open-Domain Question Answering

Karpukhin, V., Oğuz, B., Min, S., Lewis, P., Wu, L., et al.

EMNLP 2020

Canonical

arXiv:2004.04906

2020

Language Models are Few-Shot Learners

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., et al.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2005.14165

2020

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., et al.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:2005.11401

2020

Scaling Laws for Neural Language Models

Kaplan, J., McCandlish, S., Henighan, T., Brown, T. B., Chess, B., et al.

arXiv preprint

Canonical

arXiv:2001.08361

2020

Score-Based Generative Modeling through Stochastic Differential Equations

Song, Y., Sohl-Dickstein, J., Kingma, D. P., Kumar, A., Ermon, S., & Poole, B.

ICLR 2021

Canonical

arXiv:2011.13456

2019

Fairness and Abstraction in Sociotechnical Systems

Selbst, A. D., Boyd, D., Friedler, S. A., Venkatasubramanian, S., & Vertesi, J.

FAT* 2019

Canonical

DOI:10.1145/3287560.3287598

2019

Human Compatible: Artificial Intelligence and the Problem of Control

Russell, S.

Viking Press

Canonical

2019

Language Models are Unsupervised Multitask Learners

Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., & Sutskever, I.

OpenAI Technical Report

Industry-reported

2019

Model Cards for Model Reporting

Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., et al.

FAT* 2019

Canonical

arXiv:1810.03993

2019

What is the evidence on the role of the arts in improving health and well-being? A scoping review

Fancourt, D., & Finn, S.

World Health Organization Regional Office for Europe

Canonical

2018

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K.

NAACL-HLT 2019

Canonical

arXiv:1810.04805

2018

Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification

Buolamwini, J., & Gebru, T.

Conference on Fairness, Accountability and Transparency (FAccT)

Canonical

2018

Reinforcement Learning: An Introduction (2nd Edition)

Sutton, R. S., & Barto, A. G.

MIT Press

Canonical

2018

Universal Language Model Fine-tuning for Text Classification

Howard, J., & Ruder, S.

ACL 2018

Canonical

arXiv:1801.06146

2017

Attention Is All You Need

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:1706.03762

2017

CAN: Creative Adversarial Networks, Generating Art by Learning About Styles and Deviating from Style Norms

Elgammal, A., Liu, B., Elhoseiny, M., & Mazzone, M.

International Conference on Computational Creativity

Canonical

arXiv:1706.07068

2017

Deep Reinforcement Learning from Human Preferences

Christiano, P. F., Leike, J., Brown, T., Martic, M., Legg, S., & Amodei, D.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

arXiv:1706.03741

2017

Proximal Policy Optimization Algorithms

Schulman, J., Wolski, F., Dhariwal, P., Radford, A., & Klimov, O.

arXiv preprint

Canonical

arXiv:1707.06347

2016

Concrete Problems in AI Safety

Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schulman, J., & Mané, D.

arXiv preprint

Canonical

arXiv:1606.06565

2016

Deep Learning

Goodfellow, I., Bengio, Y., & Courville, A.

MIT Press

Canonical

2016

Mastering the game of Go with deep neural networks and tree search

Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., et al.

Nature, 529(7587), 484-489

Canonical

DOI:10.1038/nature16961

2015

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Ioffe, S., & Szegedy, C.

ICML 2015

Canonical

arXiv:1502.03167

2015

Deep Residual Learning for Image Recognition

He, K., Zhang, X., Ren, S., & Sun, J.

CVPR 2016

Canonical

arXiv:1512.03385

2014

Adam: A Method for Stochastic Optimization

Kingma, D. P., & Ba, J.

ICLR 2015

Canonical

arXiv:1412.6980

2014

Going Deeper with Convolutions

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., et al.

CVPR 2015

Canonical

arXiv:1409.4842

2014

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., et al.

EMNLP 2014

Canonical

arXiv:1406.1078

2014

Neural Machine Translation by Jointly Learning to Align and Translate

Bahdanau, D., Cho, K., & Bengio, Y.

ICLR 2015

Canonical

arXiv:1409.0473

2014

Superintelligence: Paths, Dangers, Strategies

Bostrom, N.

Oxford University Press

Canonical

DOI:10.2105/AJPH.2008.156497

2014

Very Deep Convolutional Networks for Large-Scale Image Recognition

Simonyan, K., & Zisserman, A.

ICLR 2015

Canonical

arXiv:1409.1556

2013

Playing Atari with Deep Reinforcement Learning

Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., et al.

NIPS Deep Learning Workshop

Canonical

arXiv:1312.5602

2012

ImageNet Classification with Deep Convolutional Neural Networks

Krizhevsky, A., Sutskever, I., & Hinton, G. E.

Advances in Neural Information Processing Systems (NeurIPS)

Canonical

DOI:10.1145/3065386

2012

Improving neural networks by preventing co-adaptation of feature detectors

Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. R.

arXiv preprint

Canonical

arXiv:1207.0580

2010

The Connection Between Art, Healing, and Public Health: A Review of Current Literature

Stuckey, H. L., & Nobel, J.

American Journal of Public Health, 100(2), 254-263

Canonical

2006

Pattern Recognition and Machine Learning

Bishop, C. M.

Springer

Canonical

DOI:10.1162/neco.1997.9.8.1735

1997

Long Short-Term Memory

Hochreiter, S., & Schmidhuber, J.

Neural Computation, 9(8), 1735-1780

Canonical

1989

Backpropagation Applied to Handwritten Zip Code Recognition

LeCun, Y., Boser, B., Denker, J. S., Henderson, D., Howard, R. E., Hubbard, W., & Jackel, L. D.

Neural Computation, 1(4), 541-551

Canonical

DOI:10.1162/neco.1989.1.4.541

1986

Learning representations by back-propagating errors

Rumelhart, D. E., Hinton, G. E., & Williams, R. J.

Nature, 323(6088), 533-536

Canonical

DOI:10.1038/323533a0

1982

Art Worlds

Becker, H. S.

University of California Press

Canonical

1958

The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain

Rosenblatt, F.

Psychological Review, 65(6), 386-408

Canonical

DOI:10.1037/h0042519

1955

A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence

McCarthy, J., Minsky, M., Rochester, N., & Shannon, C.

AI Magazine, 27(4)

Canonical

DOI:10.1093/mind/LIX.236.433

1950

Computing Machinery and Intelligence

Turing, A. M.

Mind, 59(236), 433-460

Canonical

1935

The Work of Art in the Age of Mechanical Reproduction

Benjamin, W.

Essay

Canonical