TY - GEN
T1 - Automatic Table ground truth generation and a background-analysis-based table structure extraction method
AU - Wang, Yalin
AU - Phillips, Ihsin T.
AU - Haralick, Robert
N1 - Publisher Copyright:
© 2001 IEEE.
PY - 2001
Y1 - 2001
N2 - In this paper, we first describe an automatic table ground truth generation system which can efficiently generate a large amount of accurate table ground truth suitable for the development of table detection algorithms. Then a novel background-analysis-based, coarse-to-fine table identification algorithm and an X-Y cut table decomposition algorithm are described. We discuss an experimental protocol to evaluate the table detection algorithms. For a total of 1,125 document pages having 518 table entities and a total of 10,941 cell entities, our table detection algorithm takes line, word segmentation results as input and obtains around 90% cell correct detection rates.
AB - In this paper, we first describe an automatic table ground truth generation system which can efficiently generate a large amount of accurate table ground truth suitable for the development of table detection algorithms. Then a novel background-analysis-based, coarse-to-fine table identification algorithm and an X-Y cut table decomposition algorithm are described. We discuss an experimental protocol to evaluate the table detection algorithms. For a total of 1,125 document pages having 518 table entities and a total of 10,941 cell entities, our table detection algorithm takes line, word segmentation results as input and obtains around 90% cell correct detection rates.
UR - http://www.scopus.com/inward/record.url?scp=84951779521&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84951779521&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2001.953845
DO - 10.1109/ICDAR.2001.953845
M3 - Conference contribution
AN - SCOPUS:84951779521
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 528
EP - 532
BT - Proceedings - 6th International Conference on Document Analysis and Recognition, ICDAR 2001
PB - IEEE Computer Society
T2 - 6th International Conference on Document Analysis and Recognition, ICDAR 2001
Y2 - 10 September 2001 through 13 September 2001
ER -