TY - GEN
T1 - The DIEGO Lab graph based gene normalization system
AU - Sullivan, Ryan
AU - Leaman, Robert
AU - Gonzalez, Graciela
PY - 2011/12/1
Y1 - 2011/12/1
N2 - Gene entity normalization, the mapping of a gene mention in free text to a unique identifier, is one of the primary subtasks in the biomedical information extraction pipeline. Gene entity normalization provides many challenges, specifically with the high ambiguity of gene names and the many-to-many relationship between gene names and identifiers. Drawing inspiration from recent work in word sense disambiguation, this paper presents a gene entity normalization system based on entity relationship graphs. This system creates a concept graph from the possible entities and their relationships within a full-text document, and takes advantage of a node ranking algorithm to rank and score each potential candidate entity. This system is a prototype to represent a specific approach to gene normalization, and the results reflect this. However, this system demonstrates that the relationship graph-based approach, an approach grounded in a theoretical basis, can potentially be useful for gene normalization and possibly for the normalization of various biomedical entities.
AB - Gene entity normalization, the mapping of a gene mention in free text to a unique identifier, is one of the primary subtasks in the biomedical information extraction pipeline. Gene entity normalization provides many challenges, specifically with the high ambiguity of gene names and the many-to-many relationship between gene names and identifiers. Drawing inspiration from recent work in word sense disambiguation, this paper presents a gene entity normalization system based on entity relationship graphs. This system creates a concept graph from the possible entities and their relationships within a full-text document, and takes advantage of a node ranking algorithm to rank and score each potential candidate entity. This system is a prototype to represent a specific approach to gene normalization, and the results reflect this. However, this system demonstrates that the relationship graph-based approach, an approach grounded in a theoretical basis, can potentially be useful for gene normalization and possibly for the normalization of various biomedical entities.
UR - http://www.scopus.com/inward/record.url?scp=84857836615&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84857836615&partnerID=8YFLogxK
U2 - 10.1109/ICMLA.2011.140
DO - 10.1109/ICMLA.2011.140
M3 - Conference contribution
AN - SCOPUS:84857836615
SN - 9780769546070
T3 - Proceedings - 10th International Conference on Machine Learning and Applications, ICMLA 2011
SP - 78
EP - 83
BT - Proceedings - 10th International Conference on Machine Learning and Applications, ICMLA 2011
T2 - 10th International Conference on Machine Learning and Applications, ICMLA 2011
Y2 - 18 December 2011 through 21 December 2011
ER -