DLIA2003 Technical Program

08:30-09:00 Registration
09:00-09:30 Workshop Opening and Special Presentation
Document Layout Problems Facing the Aerospace Industry
Lawrence S. Baum, John H. Boose, Molly Boose, Carey S. Chaplin, James Cheung, Ole B. Larsen, Monica Rosman Lafever, Ronald C. Provine, David Shema
Boeing Phantom Works
09:30-10:15 Paper Presentations I
  Stochastic Modeling and Learning:

    Poorly Structured Handwritten Documents Segmentation using Continuous Probabilistic Feature Grammars
    T. Artičres
    LIP6, Université Paris
    Mining spatial association rules from document layout structures
    Margherita Berardi, Michelangelo Ceci, Donato Malerba
    Dipartimento di Informatica Universitŕ degli Studi di Bari
    Selection of table areas for information extraction
    Ana Costa e Silva, Alípio Jorge, Luís Torgo
    Banco de Portugal, Universidade do Porto, Universidade do Porto
10:15-10:30 Coffee
10:30-11:15 Paper Presentations II
  Document Structure Understanding:

    Indexing and Retrieval of Document Images Using Term Positions and Physical Structures
    Koichi Kise, Keinosuke Matsumoto
    Dept. of Computer and Systems Sciences, Osaka Prefecture University
    Layout Analysis based on Text Line Segment Hypotheses
    Thomas M. Breuel
    Palo Alto Research Center (PARC)
    Assuming Accurate Layout Information for Web Documents is Available, What Now?
    Hassan Alam, Rachmat Hartono, Aman Kumar, Fuad Rahman, Yuliya Tarnikova and Che Wilcox
    BCL Technologies Inc.
11:15-12:15 Discussion Groups I:
  Group Ia: Stochastic Modeling
  Group Ib: Document Structure Understanding
12:15-13:15 Lunch
13:15-13:45
 
 
Demonstration: WISDOM++
Margherita Berardi, Michelangelo Ceci, Donato Malerba
Dipartimento di Informatica Universitŕ degli Studi di Bari
13:45-14:30 Paper Presentations III
  Digital Document Interpretation:

    Ground-Truth Production and Benchmarking Scenarios Creation With DocMining
    Eric Clavier, Pierre Heroux, Joel Gardes, Eric Trupin
    FTR&D, FTR&D, PSI Laboratory, University of Rouen
    Assuming Accurate Layout Information is Available: How do we Interpret the Content Flow in HTML Documents?
    Hassan Alam and Fuad Rahman
    BCL Technologies Inc.
    Background pattern recognition in multi-page PDF document
    Hui Chao
    Hewlett-Packard Labs
14:30-15:30 Discussion Groups II:
  Group IIa: Digital Document Interpretation
  Group IIb: The Future of DLIA
15:30-15:45 Tea
15:45-16:45 Plenary Session: Reports from Discussion Chairs