CONTEXT: This article presents a systematic and comprehensive literature review of agricultural research for development (AR4D) programs. OBJECTIVE: The review aims to distil commonly mentioned outcomes of AR4D programs and intervention, their causal conditions, and their causal relationships. The review also seeks to unpack what complex causation means in the context of AR4D. METHOD: Following PRISMA guidelines, the review covers the period from 1980 to June 2023 and includes a meticulous selection of peer-reviewed journal articles, books, chapters, and grey literature (n = 57 from an initial sample of n = 427). RESULTS AND CONCLUSIONS: The findings reveal a clear and limited set of outcomes and conditions, highlighting coherence and manageability within the field. However, concerns arise regarding the representation of real-world AR4D programs, with a bias towards reporting positive outcomes and successful initiatives while overlooking less successful or failing programs, particularly from Latin America and Central Asia. Complex causation emerges as a recurrent theme, emphasizing the need for innovative research methods to understand the intricate relationships between outcomes and multiple contributing factors. Furthermore, scaling successful programs is a pressing topic, challenging assumptions of replicability and calling for a comprehensive understanding of scaling processes.