Functions in Korppi Korppi


Log in!
Jun/30/2022 23:32

TIES671 COM2: Information Extraction & Automatic Knowledge-Base Construction (JSS25), 3 ECTS [home page]

Department of Computer Science and Information Systems, Department of Mathematical Information Technology

The language used was temporarily changed. Personal information page allows you to save your language settings.
You cannot register for the course because the course has expired.
The registration deadline for this course passed 30.4.15 at 23:59.
The course is full.

General information

Home page: http://www.jyu.fi/summerschool
Begins - ends: 10.8.15 - 14.8.15
Registration period: 1.2.15 at 0:00 - 30.4.15 at 23:59
Instructor(s): Michael Cochez (michaelcochez@gmail.com)
Credits: 3 ECTS cr.
Languages: language(s) of instruction: English; completion language(s): English
Registered: 36
Max participants: 35
Still room for: 0
Organisations:Department of Computer Science and Information Systems (TKO), Department of Mathematical Information Technology (MIT), Faculty of Information Technology (IT), Faculty of Mathematics and Science (SCI), Information technology (MIT) (TIE), Jyväskylä Summer School (JSS)
Current events:
  • Lecturer: Prof. Martin Theobald (University of Antwerp, Belgium)
  • The course is a part of the 25th international Jyväskylä Summer School programme.

The Web bears the potential of being the world’s greatest encyclopedic source, but we are still far from fully exploiting this potential. Valuable scientific and cultural content is interspersed with a huge amount of noisy, low-quality, unstructured text and media. The proliferation of knowledge-sharing communities like Wikipedia and recent advances in automated information-extraction technologies from otherwise unstructured Web documents give rise to an unprecedented opportunity: Can we systematically harvest facts from the Web and compile them into a comprehensive machine-readable knowledge base?

Such a knowledge base would contain not only the world’s entities, but also their semantic properties, and their relationships with each other.Imagine a “Structured Wikipedia” that has the same scale and richness as Wikipedia itself, but offers a precise and concise representation of knowledge, e.g., in the versatile RDF/S data format. This would enable expressive and highly precise querying, e.g., in the SPARQL language (or appropriate extensions), with additional capabilities for informative ranking of query results. The benefits from solving the above challenge would obviously be enormous. Applications include:

  1. a formalized machine-readable encyclopedia that can be queried with high precision like a semantic database;
  2. a key asset for disambiguating entities by supporting fast and accurate mappings of textual phrases onto named entities in the knowledge base; and an enabler for entity-relationship-oriented semantic search on the Web, for detecting entities and relations in Web pages and reasoning about them in expressive (probabilistic) logics;
  3. a backbone for natural-language question answering that would aid in dealing with entities and their relationships in answering who/where/when/ etc. questions;
  4. a catalyst for the acquisition of further knowledge and the largely automated maintenance and growth of the knowledge base.

Each of the above aspects will be captured in the form of a 3-hour lecture. Each lecture provides an overview of the theoretical foundations of the respective block and summarizes recent research results in the related fields. A block is complemented by a 2 hour interactive exercise session with a final programming task, during which students are able to practice their newly gained knowledge.


Basic knowledge in databases, basic programming skills (Java/C++), prior course in statistics and/or probability theory is a plus.

Modes of study:

Obligatory attendance at lectures. Practical task based on theory lectures, programming challenge with options for team work. 

Completion mode:

Grading scale of 0 to 5

Course workload:

20 hours (4 x 5 h) with interactive exercises. Five hour blocks would itself be divided in 3 hours of lectures and two hours of exercises. In addition optional demonstrations in the afternoons five times a week four hours per day.

[Limit information on study groups]

Lecture [group details and registration]

Lecture [group details and registration]; registered 32, maximum 35
reg.time: 1.2.2015 00:00 - 30.4.2015 23:59
 LocationWeekDayDateAtSupervisorFurther informationURITapahtuman tiedot
1Ag C23433Mo10.8.201509:00-13:00CochezTapahtuman tiedot
2Ag C23433Tu11.8.201509:00-13:00CochezTapahtuman tiedot
3Ag C23433Wed12.8.201509:00-13:00CochezTapahtuman tiedot
4Ag C23433Th13.8.201509:00-13:00CochezTapahtuman tiedot
5Ag C23433Fr14.8.201509:00-13:00CochezTapahtuman tiedot

Demonstration [group details and registration]

Demonstration [group details and registration]; maximum 35
reg.time: 1.2.2015 00:00 - 30.4.2015 23:59
 LocationWeekDayDateAtSupervisorFurther informationURITapahtuman tiedot
1Ag C331.3, mikroluokka33Mo10.8.201509:00-18:00CochezTapahtuman tiedot
2Ag C331.3, mikroluokka33Tu11.8.201509:00-18:00CochezTapahtuman tiedot
3Ag C331.3, mikroluokka33Wed12.8.201509:00-18:00CochezTapahtuman tiedot
4Ag C331.3, mikroluokka33Th13.8.201509:00-18:00CochezTapahtuman tiedot
5Ag C331.3, mikroluokka33Fr14.8.201509:00-18:00CochezTapahtuman tiedot