Projects funded by the NCN


Information on the principal investigator and host institution

Information of the project and the call

Keywords

Equipment

Delete all

Automatic detection and correction of annotation errors in Polish language corpora

2011/01/N/ST6/01107

Keywords:

language corpus corpus annotation error detection anomaly detection machine learning

Descriptors:

  • ST6_8: Computer graphics, image processing, computer vision, multimedia, computer games
  • ST6_6: Algorithms, parallel, distributed and network algorithms, algorithmic game theory

Panel:

ST6 - Computer science and informatics: informatics and information systems, computer science, scientific computing, intelligent systems

Host institution :

Instytut Podstaw Informatyki PAN

woj. mazowieckie

Other projects carried out by the institution 

Principal investigator (from the host institution):

dr Łukasz Kobyliński 

Number of co-investigators in the project: 4

Call: PRELUDIUM 1 - announced on 2011-03-15

Amount awarded: 224 550 PLN

Project start date (Y-m-d): 2011-12-21

Project end date (Y-m-d): 2014-06-20

Project duration:: 30 months (the same as in the proposal)

Project status: Project settled

Equipment purchased [PL]

  1. Dysk zewnętrzny (500 PLN)
  2. Notebook (5 000 PLN)

Information in the final report

  • Articles in post-conference publications (3)
  1. Improving the accuracy of Polish POS tagging by using voting ensembles
    Authors:
    Łukasz Kobyliński
    Conference:
    6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics (rok: 2013, ), Wydawca: Wydawnictwo Poznańskie, Fundacja Uniwersytetu im. Adama Mickiewicza
    Data:
    konferencja 7-9 grudnia 2013
    Status:
    Published
  2. Automatic Detection of Annotation Errors in Polish-Language Corpora
    Authors:
    Łukasz Kobyliński
    Conference:
    International Conference Language Processing and Intelligent Information Systems (rok: 2013, ), Wydawca: Springer-Verlag
    Data:
    konferencja 17-18 czerwca 2013
    Status:
    Published
  3. PoliTa: A multitagger for Polish
    Authors:
    Łukasz Kobyliński
    Conference:
    Ninth International Conference on Language Resources and Evaluation, LREC 2014 (rok: 2014, ), Wydawca: ELRA
    Data:
    konferencja 26-31 maja 2014
    Status:
    Published