Many education research questions cannot be answered using participants from one institution or short periods of follow-up. In response to societal demands for accountability and evidence of effectiveness, new models of research must be developed to study the outcomes of educational activities. Following the 2007 Millennium Conference on Medical Education Research, organizers assigned a task force to explore the use of longitudinal databases in education research. This article summarizes the task force's findings. Similar to the Framingham studies in clinical medicine, longitudinal databases assemble prospectively collected information to retrospectively answer questions of interest. Many studies using such databases have been published. The task force identified three general approaches to database-type research. First, institutions can obtain identified information from existing sources, link it with school-specific information and other identified information, deidentify it, and merge it with similar information from other collaborating schools. Second, researchers can obtain from existing sources deidentified information on large samples and explore associations within this dataset. Third, investigators can design and implement databases to prospectively collect trainee information over time and across multiple institutions for the purpose of education research. Although costly, such comprehensive, purpose-built databases would ensure the availability of information needed to answer a variety of medical education research questions. Millennium Conference participants believed that stakeholders should explore the funding and development of such prospective databases. In the meantime, education researchers should use existing sources of individualized learner data to better understand how to develop competent, compassionate clinicians.