IMPACT SCORE JOURNAL RANKING CONFERENCE RANKING Conferences Journals Workshops Seminars SYMPOSIUMS MEETINGS BLOG LaTeX 5G Tutorial Free Tools
NoDaLiDa 2019 : Second Call for Participation - FinTOC Shared-task @FNP2019 @NoDaLiDa2019
NoDaLiDa 2019 : Second Call for Participation - FinTOC Shared-task @FNP2019 @NoDaLiDa2019

NoDaLiDa 2019 : Second Call for Participation - FinTOC Shared-task @FNP2019 @NoDaLiDa2019

Turku Finland
Event Date: April 15, 2019 - July 13, 2019
Submission Deadline: January 01, 1970




Call for Papers

Second call for participation - FinTOC shared task
⇒ The Second Financial Narrative Processing Workshop (FNP 2019)
⇒ The 22nd Nordic Conference on Computational Linguistics (NoDaLiDa’19 Turku Finland)

***
The best performing methods will have their papers published in the FNP2019 workshop!
***

Task: Predict a Table of Content (ToC) from financial documents.
Two sub-tasks are proposed :
Detection of titles
Prediction of a ToC

Shared task webpage: http://wp.lancs.ac.uk/cfie/shared-task/
Shared task contact: [email protected]
Organizers:
- Najah-Imane BENTABET
- Sira Ferradans
- Remi Juge

Important dates
Registration deadline: June 29, 2019
Submission deadline: July 13, 2019
Workshop day: September 30, 3019


More reading
“Financial Document Structure Extraction”
Introduction:
A vast amount of financial documents are created and published constantly in machine-readable formats (generally PDF file format), with only minimal structure information. Firms use such documents to report their activities, financial situation or potential investment plans to shareholders, investors and the financial markets, basically corporate annual reports containing detailed financial and operational information.
In some countries as in the US or in France, regulators as EDGAR SEC or AMF require firms to follow a certain template when reporting their financial results to ensure standardisation and consistency across firms’ disclosures. In other European countries, on the other hand, the management usually has more discretion on what where and how to report resulting in lack of standardisation between financial documents published within the same market.

In this shared task, we focus on analysing Financial Prospectuses; official PDF documents in which investment funds precisely describe their characteristics and investment modalities. Although the content they must include is often regulated, their format is not standardized and displays a great deal of variability ranging from plain text format, towards more graphical and tabular presentation of data and information. The majority of prospectuses are published without a table of content (TOC), which is usually needed to help readers to navigate within the document by following a simple outline of headers and page numbers, and assist professional teams in checking if all the contents required are fully included. Thus, automatic analyses of prospectuses to extract their structure is becoming more and more vital to many firms across the world.

Task:
As part of the Financial Narrative Processing Workshop, we present a shared task on Financial Document Structure Extraction.
Systems participating in this shared task will be given a sample collection of financial prospectuses with different level of structure and different lengths (document sizes), which are to be automatically analyzed to extract structural information and build a table of content.
The task will contain two subtasks are:
a) Title detection
This is a binary classification task aiming at detecting titles in financial prospectuses. Given a set of text blocks, the goal is to classify each given text block as a ‘title’ or ‘non-title’. Titles can have different layouts and they have to be distinguished from the regular text.
b) TOC structure extraction
The TOC is a hierarchical organisation of the headers of a document. In this subtask, we provide only the headers of a prospectus, and the goal is to (i) identify the hierarchical level of the header (ii) organize the headers of the document according to this hierarchical structure. Note that two headers, with the same layout and the same text can have different hierarchical levels depending on their location in the document.
Participants need to register. Once registered, all participating teams will be provided with a common training dataset, which includes common pre-processed input and corrected output. A common development set will also be provided. A blind test data set will be used to evaluate the output of the participating teams. An evaluation script will be provided to all the teams. In addition to the PDF version of the documents, we will provide their XML representation.


Background:
Existing work on book and document table of contents (TOC) recognition has been almost all on small size, application-dependent, and domain-specific datasets. However, TOC of documents from different domains differ significantly in their visual layout and style, making TOC recognition a challenging problem for a large scale collection of heterogeneous documents and books. Compared to regular books (mostly provided in a full-text format with limited structural information such as pages and paragraphs), Financial documents, containing textual and non-textual content, have a more sophisticated structure including, parts, sections, sub-sections, sub-sub-sections.

Important Dates:
(suggested plan FNP FinTOC task at NoDaLiDa 2019)

March 25, 2019: First announcement of shared task
April 10, 2019: set up of shared task website
April 15, 2019: registration begins and release of initial training sets and scoring script
May 18, 2019: Final training data release
Jun 29, 2019: registration deadline
July 6, 2019: test set available
July 13, 2019: systems’ outputs collected
July 20, 2019: system results due to participants
July 27, 2019: shared task system papers due
Aug 10, 2019: reviews due
Aug 17, 2019: notification of acceptance
Aug 24, 2019: camera-ready version of shared task system papers due
Sep 30, 2019: Workshop day
Shared Task Contact:
Questions about FinTOC-2019 shared task can be sent to:
[email protected]



Credits and Sources

[1] NoDaLiDa 2019 : Second Call for Participation - FinTOC Shared-task @FNP2019 @NoDaLiDa2019


Check other Conferences, Workshops, Seminars, and Events


OTHER FINANCE EVENTS

Gen AI in Finance SI PDW 2024: Generative AI in Finance SI Paper Development Workshop
Dresden
Jul 7, 2024
APEF 2024: 2024 Asia-Pacific Conference on Economics and Finance ‘LIVE’
Singapore
Dec 12, 2024
WAIFC YAA 2024: Young Academic Award of the World Alliance of International Financial Centers (WAIFC)
Tokyo, Japan
Oct 15, 2024
ROGE 2024: RESTRUCTURING OF THE GLOBAL ECONOMY (ROGE) 2024 - PROMOTING SUSTAINABILITY
Said Business School, Park End Street, O
Aug 5, 2024
11th ICMS 2024: 11th International Conference on Management Studies (ICMS)
Istanbul
Aug 10, 2024
SHOW ALL

OTHER MACHINE LEARNING EVENTS

NLPAI 2024: 2024 5th International Conference on Natural Language Processing and Artificial Intelligence (NLPAI 2024)
Chongqing, China
Jul 12, 2024
ICAITE 2024: 2024 the International Conference on Artificial Intelligence and Teacher Education (ICAITE 2024)
Beijing, China
Oct 12, 2024
DL for Neuro-heuristic Brain Analysis 2024: Workshop on Deep Learning for Neuro-heuristic Brain Analysis @ ICANN'24
Lugano, Switzerland
Sep 17, 2024
Informed ML for Complex Data@ESANN 2024: Informed Machine Learning for Complex Data special session at ESANN 2024
Bruges, Belgium
Oct 9, 2024
LearnAut 2024: Learning and Automata
Tallinn, Estonia
Jul 7, 2024
SHOW ALL

OTHER CLASSIFICATION EVENTS

DaMi 2024: 10th International Conference on Data Mining
Sydney, Australia
Jun 22, 2024
ICDM 2023: 23th Industrial Conference on Data Mining
New York, USA
Jul 12, 2023
IncrLearn 2022: Incremental classification and clustering, concept drift, novelty detection, active learning in big/fast data context
Orlando
Nov 30, 2022
FSDM 2022: 8th International Conference on Fuzzy Systems and Data Mining
Xiamen, China
Nov 4, 2022
IFCS 2022: Classification and Data Science in the Digital Age
Portugal
Jul 19, 2022
SHOW ALL