| 08:30-09:00 |
Registration |
| 09:00-09:30 |
Workshop Opening and Special Presentation |
|
Document Layout Problems Facing the Aerospace Industry
Lawrence S. Baum, John H. Boose, Molly Boose, Carey
S. Chaplin, James Cheung, Ole B. Larsen, Monica Rosman
Lafever, Ronald C. Provine, David Shema
Boeing Phantom Works
|
| 09:30-10:15 |
Paper Presentations I |
|
Stochastic Modeling and Learning:
Poorly Structured Handwritten
Documents Segmentation using Continuous Probabilistic Feature
Grammars
T. Artičres
LIP6, Université Paris
|
|
Mining spatial association rules from document layout structures
Margherita Berardi, Michelangelo Ceci, Donato Malerba
Dipartimento di Informatica Universitŕ degli Studi di Bari
|
|
Selection of table areas for information extraction
Ana Costa e Silva, Alípio Jorge, Luís Torgo
Banco de Portugal, Universidade do Porto, Universidade do Porto
|
| 10:15-10:30 |
Coffee |
| 10:30-11:15 |
Paper Presentations II
|
|
Document Structure Understanding:
Indexing and Retrieval of Document Images Using Term Positions and Physical Structures
Koichi Kise, Keinosuke Matsumoto
Dept. of Computer and Systems Sciences, Osaka Prefecture University
|
|
Layout Analysis based on Text Line Segment Hypotheses
Thomas M. Breuel
Palo Alto Research Center (PARC)
|
|
Assuming Accurate Layout Information for Web Documents is Available, What Now?
Hassan Alam, Rachmat Hartono, Aman Kumar, Fuad Rahman, Yuliya Tarnikova and Che Wilcox
BCL Technologies Inc.
|
| 11:15-12:15 |
Discussion Groups I:
Group Ia: Stochastic Modeling
Group Ib: Document Structure Understanding
|
| 12:15-13:15 |
Lunch |
13:15-13:45 |
Demonstration: WISDOM++
Margherita Berardi, Michelangelo Ceci, Donato Malerba
Dipartimento di Informatica Universitŕ degli Studi di Bari |
| 13:45-14:30 |
Paper Presentations III |
|
Digital Document Interpretation:
Ground-Truth Production and Benchmarking Scenarios Creation With DocMining
Eric Clavier, Pierre Heroux, Joel Gardes, Eric Trupin
FTR&D, FTR&D, PSI Laboratory, University of Rouen
|
|
Assuming Accurate Layout Information is Available: How do we Interpret the Content Flow in HTML Documents?
Hassan Alam and Fuad Rahman
BCL Technologies Inc.
|
|
Background pattern recognition in multi-page PDF document
Hui Chao
Hewlett-Packard Labs
|
| 14:30-15:30 |
Discussion Groups II:
Group IIa: Digital Document Interpretation
Group IIb: The Future of DLIA
|
| 15:30-15:45 |
Tea |
| 15:45-16:45 |
Plenary Session: Reports from Discussion Chairs |