MediaWiki has been successfully installed.
Consult the User's Guide for information on using the wiki software.
Welcome to the THYME project
Welcome to the Temporal Histories of Your Medical Event (THYME) project.
The overarching long-term vision of our research is to create novel technologies for processing clinical free text. Such technologies will enable sophisticated and efficient indexing, retrieval and data mining over the ever increasing amounts of electronic clinical data. Processing free text poses a number of challenges to which the fields of Artificial intelligence, natural language processing and computer science in general have made advances. Methods for processing free text are informed by linguistic theory combined with the power of statistical inferencing. A key component to the next step, natural language understanding, is discovering events and their relations on a timeline. Temporal relations are of prime importance in biomedicine as they are intrinsically linked to diseases, signs and symptoms, and treatments. Understanding the timeline of clinically relevant events is key to the next generation of translational research where the importance of generalizing over large amounts of data holds the promise of deciphering biomedical puzzles.
The goal of our current proposal is to discover temporal relations from clinical free text through achieving four specific aims:
Specific Aim 1: Develop (1) a temporal relation annotation schema and guidelines for clinical free text based on TimeML, which will require extensions to Treebank, PropBank and VerbNet annotation guidelines to the clinical domain, (2) an annotated corpus following the temporal relations schema with additions to Treebank, PropBank and VerbNet, (3) a descriptive study comparing temporal relations in the clinical and general domains.
Specific Aim 2: Extend and evaluate existing methods and/or develop new algorithms for temporal relation discovery in the clinical domain. Component-level evaluation
Specific Aim 3: Integrate best method and/or a variety of methods for temporal relation discovery into the cTAKES and release as open source annotators in the pipeline. Functional testing. Dissemination activities.
Specific Aim 4: System-level evaluation. Test the functionality of the enhanced cTAKES on translational research use cases, e.g. the progression of colon cancer as documented in clinical notes and pathology reports, the progression of brain tumor as documented in radiology reports.
The methods we will use for the temporal relation discovery are based on machine learning, e.g., Support Vector Machine technology. Such methods require the annotation of a reference standard from which the computations are derived. The best methods will be released as part of the cTAKES for the larger community to use and contribute to. We will test the methods against biomedical queries.
Who We Are
- University of Colorado
- Martha Palmer (PI)
- Jim Martin
- Wayne Ward
- Steven Bethard
- William Styler
- Arrick Lanfranchi
- Anwen Fredricsen
- Childrens Hospital Boston/Harvard Medical School
- Guergana Savova (PI)
- Dmitriy Dligach
- Timothy Miller
- Glenn Zaramba
- David Harris
- Mayo Clinic
- Piet de Groen
- Brad Erickson
- James Masanz
- Donna Ihrke
- Brandeis University
- James Pustejovsky
Clinical Temporal Relations Annotation Guidelines - Release notes and latest versions
Annotations - Describes the corpus, the layers of annotations and annotation progress
- Bi-weekly meetings, Wed 11-noon ET
- Call in details
- Feb 1, 2012 Agenda and notes
If you need assistance, feel free to send e-mail to daryl.lonnon at colorado dot edu