
Vivian Beaumont Allen Professor of Biomedical Informatics, Columbia University
Chair, Department of Biomedical Informatics, Columbia University
Director, Medical Informatics Services, NewYork-Presbyterian
Hospital/Columbia
Senior Informatics Advisor, New York City Dept of
Health and Mental Hygiene
Co-chair, Meaningful Use Workgroup, HIT Policy Committee of the Office of the National
Coordinator of Health Information Technology
Biography
George Hripcsak, MD, MS, is
Vivian Beaumont Allen Professor and Chair of Columbia University’s Department
of Biomedical Informatics, Director of Medical Informatics Services for NewYork-Presbyterian Hospital, and Senior Informatics
Advisor at the New York City Department of Health and Mental Hygiene. Dr.
Hripcsak is a board-certified internist with degrees in chemistry, medicine,
and biostatistics. He led the effort to create the Arden Syntax, a language for
representing health knowledge that has become a national standard. Dr. Hripcsak’s current research focus is on the clinical
information stored in electronic health records. Using data mining techniques
such as machine learning and natural language processing, he is developing the
methods necessary to support clinical research and patient safety initiatives.
As Director of Medical Informatics Services, he oversees a 7000-user,
2.5-million-patient clinical information system and data repository. He is
currently co-chair of the Meaningful Use Workgroup of
HHS’s Office of the National Coordinator of Health Information Technology; it
defines the criteria by which health care providers collect incentives for
using electronic health records. Dr. Hripcsak was elected fellow of the
American College of Medical Informatics in 1995 and served on the Board of
Directors of the American Medical Informatics Association (AMIA). As chair of
the AMIA Standards Committee, he coordinated the medical-informatics community
response to the Department of Health and Human Services for the
health-informatics standards rules under the Health Insurance Portability and
Accountability Act of 1996. Dr. Hripcsak chaired the National Library of
Medicine’s Biomedical Library and Informatics Review Committee, and he is a
fellow of the American College of Medical Informatics and the New York Academy
of Medicine. He has published over 200 papers.
What Is Informatics?
My
research focuses on understanding and using the clinical information stored in the
electronic health record. This theme has several components:
1. Data mining and knowledge discovery (see “Discovering and
applying knowledge in clinical databases” project web site). Machine
learning and visualization are examples of techniques to uncover knowledge from
vast clinical databases. My work focuses on testing and extending existing
discovery methods to improve their performance on clinical databases. Important
issues include training set size, data accuracy, data completeness, and
representation (e.g., how to accommodate diagnostic data, which is nominal with
many categories). Recent work includes the use of non-linear time series
analysis to characterize the electronic health record. Here is a study of serum
glucose, where predictability quantified as mutual information reveals the
diurnal variation of glucose (evidenced by the ridges). See [Albers DJ,
Hripcsak G. A
statistical dynamics approach to the study of human health data: resolving
population scale diurnal variation in laboratory data. Physics Letters A
2010;374:1159-64] for a related study on creatinine.

2. Natural language processing. In most institutions,
the vast majority of the richly detailed clinical information is stored as
narrative text, which is not generally amenable to automated analysis. Natural
language processing can parse the narrative text, converting it to a structured
and coded format. See [Hripcsak G, Elhadad N, Chen C, Zhou L, Morrison FP. Using
empirical semantic correlation to interpret temporal assertions in clinical
texts. J Am Med Inform Assoc 2009;16:220-7] for a study of the degree to which the true time
of an event varies from what is stated in the patient record. It is illustrated
below, where 1 marks the time of the writing of the note, and 0 marks the
stated time.

3. Evaluation methodology. The complexity of clinical
data, the presence of inaccurate and missing values, and the large but
heterogeneous collection of patients conspire to make it difficult to draw
conclusions using traditional statistical methods. Bias that would not affect a
traditional randomized trial can overwhelm the true effect in a retrospective
study of the electronic medical record. Here is a short piece on measuring
agreement [Hripcsak G, Rothschild AS. Agreement, the F-measure, and reliability in information retrieval.
J Am Med Inform Assoc 2005;12:296-8].
5. Clinical demonstration. Demonstrating the
usefulness of the above methods is critical to gather support and to focus new
work in important areas. The methods can be applied to clinical research
(largely hypothesis refinement) and clinical care (by generating timely advice
and monitoring patient safety). Recent work has included syndromic
surveillance and pharmacovigilance. See [Hripcsak G, Soulakis ND, Li L, Morrison FP, Lai AM, Friedman C, Calman NS, Mostashari F. Syndromic surveillance using ambulatory electronic health
records. J Am Med Inform Assoc 2009;16:354-61] for a study of surveillance.
In
addition, I am studying next-generation electronic health records. Current
technology supports individual clinician tasks, such as documenting and
ordering, in a manner that is largely similar to that of traditional paper records.
Improved understanding of workflow, information needs, cognition, and the
science of collaboration can lead to improved systems that exploit human
abilities, facilitate teams, and disseminate expertise. Here is a recent
editorial [Shea S, Hripcsak G. Accelerating the use of
electronic health records in physician practices. NEJM 2010;362:192-5].
As Director of Medical Informatics Services
for NewYork-Presbyterian Hospital/Columbia, I
overseeing the clinical data warehouse, terminology, WebCIS,
immunization, infection control, and physician outreach and collaborate on
clinician documentation, health information exchange, and patient portals.
·
WebCIS Web-based clinical
information system
As Co-chair of the Meaningful Use Workgroup
of the HIT Policy Committee of the Office of the National Coordinator of Health
Information Technology, I participate in the definition of "meaningful
use," by which the HITECH Act encourages the adoption of electronic health
records to promote improved quality and efficiency of health care.
We
offer programs at all levels of informatics training, including PhDs, master's
degrees, postdoctoral fellowship, certificate training, and education for
students in medicine, nursing, dentistry, and public health. See www.dbmi.columbia.edu.
George
Hripcsak, MD, MS
622 West 168th Street, VC-5
New York, NY 10032
hripcsak@columbia.edu