DI 2022 : DI 2022: Document Intelligence Workshop @ KDD 2022
DI 2022 : DI 2022: Document Intelligence Workshop @ KDD 2022

DI 2022 : DI 2022: Document Intelligence Workshop @ KDD 2022

Washington DC
Event Date: August 14, 2022 - August 14, 2022
Submission Deadline: June 09, 2022
Notification of Acceptance: June 20, 2022

Call for Papers

Call for Papers

Document Intelligence Workshop @ KDD 2022


Business documents are central to the operation of all organizations, and they come in all shapes and sizes: project reports, planning documents, technical specifications, financial statements, meeting minutes, legal agreements, contracts, resumes, purchase orders, invoices, and many more. The ability to read, understand and interpret these documents, referred to here as Document Intelligence (DI), is challenging due to their complex formats and structures, internal and external cross references deployed, quality of scans and OCR performed, and many domains of knowledge involved.

While a variety of research has advanced the fundamentals of document understanding, the majority have focused on documents found on the web which fail to capture the complexity of analysis and types of understanding needed across business documents. Realizing the vision of Document Intelligence remains a research challenge that requires a multi-disciplinary perspective spanning not only natural language processing and understanding, but also computer vision, layout understanding, knowledge representation and reasoning, data mining, knowledge discovery, information retrieval, and more – all of which have been profoundly impacted and advanced by deep learning in the last few years. This workshop aims to explore and advance the current state of research and practice, including but not limited to the following topics:

- Document modeling and representations.
- Document structure and layout learning and recognition.
- Cleansing and image enhancement techniques for scanned documents.
- Information extraction from text and semi-structured documents.
- Linguistic analysis of business documents.
- Natural language reasoning and inference.
- Question answering on business documents.
- Semantic understanding of business documents.
- Document search and clustering
- Handwritten recognition in business documents.
- Table identification and extraction from business documents.
- Chart learning and understanding.
- Domain-specific document understanding.
- Knowledge representation for business documents.
- Multilingual document understanding methods and frameworks.
- Integrated syntax and semantic approaches for document understanding.
- Transfer learning methods for business document reading and understanding.

In addition to the invited talks and the panel discussion on topics related to Document Intelligence, the workshop program will include paper sessions which provides an opportunity to present peer-reviewed work on the topic related to Document Intelligence.


We are soliciting submissions of short papers in PDF format and formatted according to the Standard ACM Conference Proceedings Template.

- Word authors: please use Interim layout.docx/interim sample pdf.
LaTeX authors: please download LATEX (Version 1.77) and use \documentclass[sigconf]{acmart}.
- Submissions are limited to 4 pages, not including references. Submissions that do not meet the formatting requirements will be rejected without review.

Submissions can be original research contributions, or abstracts of papers previously submitted to top-tier venues, but not currently under review in other venues and not yet published. The research contributions may discuss technical challenges of reading and interpreting business documents and present research results.

The review process is double-blind. The submitted contributions will be peer-reviewed by the Program Committee, and preference will be given to high-quality original and relevant work to the Document Intelligence topics.

It is expected that one of the authors of accepted contributions will register and attend the workshop to present the work in video in the workshop’s Paper Sessions (format to be decided). Accepted contributions will be made publicly available as non-archival reports, allowing future submissions to archival conferences or journals.

#Submission URL

Microsoft Research CMT:

#Important Dates

- Paper Submission Deadline: June 9, 2022 (anywhere on Earth).
- Paper Notification Date: June 20, 2022.
- Paper Final Version Due: TBD, 2022.
- Workshop Date: August 14, 2022 (Sunday).

#Workshop Website

#Contact Information:

Email: [email protected]

#Workshop Organizing Committee

- Douglas Burdick (IBM Research)
- Benjamin Han (Microsoft Azure AI)
- Dave Lewis (Redgrave Data)
- Sandeep Tata (Google Research)
- Dan Tecuci (EY)

#Program Committee Chair

Ani Nenkova (Adobe Research)

#Past Workshop

- DI-2021: Second Document Intelligence Workshop @ KDD 2021
- DI-2019: First Document Intelligence Workshop @ NeurIPS 2019
- DI-2019 Report: Hamid Motahari, Nigel Duffy, Paul Bennett, and Tania Bedrax-Weiss. A Report on the First Workshop on Document Intelligence (DI) at NeurIPS 2019. SIGKDD Explorations, Vol. 22, Issue 2. December 2020.


DI 2022 : DI 2022: Document Intelligence Workshop @ KDD 2022 will take place in Washington DC. It’s a 1 day event starting on Aug 14, 2022 (Sunday) and will be winded up on Aug 14, 2022 (Sunday).

DI 2022 falls under the following areas: DOCUMENT UNDERSTANDING, NATURAL LANGUAGE PROCESSING, COMPUTER VISION, INFORMATION EXTRACTION, etc. Submissions for this Workshop can be made by Jun 9, 2022. Authors can expect the result of submission by Jun 20, 2022.

Please check the official event website for possible changes before you make any travelling arrangements. Generally, events are strict with their deadlines. It is advisable to check the official website for all the deadlines.

Other Details of the DI 2022

  • Short Name: DI 2022
  • Full Name: DI 2022: Document Intelligence Workshop @ KDD 2022
  • Timing: 09:00 AM-06:00 PM (expected)
  • Fees: Check the official website of DI 2022
  • Event Type: Workshop
  • Website Link:
  • Location/Address: Washington DC

Credits and Sources

[1] DI 2022 : DI 2022: Document Intelligence Workshop @ KDD 2022

Check other Conferences, Workshops, Seminars, and Events


ESWC 2021: Extended Semantic Web Conference
Jun 6, 2021
AICCSA 2022: 19th ACS/IEEE International Conference on Computer Systems and Applications
Abu Dhabi, UAE
Dec 5, 2022
NLP4DH 2022: 2nd International Workshop on Natural Language Processing for Digital Humanities
Taipei, Taiwan
Nov 24, 2022
ALQAC 2022: Automated Legal Question Answering Competition
Nha Trang, Vietnam
Oct 19, 2022
iTextbooks 2022: iTextbooks 2022 : Fourth Workshop on Intelligent Textbooks at AIED 2022
Durham, UK
Jul 27, 2022


KST 2023: 2023 15th International Conference on Knowledge and Smart Technology (KST)
Novotel Vintage Park, Phuket, Thailand
Feb 21, 2023
WSCG 2023: 31. International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision 2023
May 15, 2023
GMLR @ ACM SAC 2023: ACM SAC Track on Graph Models for Learning and Recognition
Tallinn, Estonia
Mar 27, 2023
CVPR 2023: The IEEE/CVF Conference on Computer Vision and Pattern Recognition
Vancouver, Canada
Jun 18, 2023
Nov 26, 2022


TempWeb 2022: The 12th Temporal Web Analytics Workshop (TempWeb 2022)
Lyon, France (online)
Apr 25, 2022
WebNLG+ 2020: 3rd Workshop on Natural Language Generation from the Semantic Web
Dublin, Ireland (Remote)
Dec 18, 2020
SLIE 2021: Semantic, Logics, Information Extraction and AI
North-Miami Beach
May 16, 2021
SLIE 2020: Semantic, Logics, Information Extraction and AI
North Miami Beach
May 17, 2020
NoDaLiDa 2019: [FNP 2019] Second Financial Narrative Processing Workshop
Turku, Finland
Apr 15, 2019