Projects funded by the NCN


Information on the principal investigator and host institution

Information of the project and the call

Keywords

Equipment

Delete all

Classification based on high-dimensional open-set data - with applications in Text Mining

2016/21/B/ST6/02159

Keywords:

open-set classification high-dimensional data classification and clustering of text document outlier detection in high-dimensional data machine learning

Descriptors:

  • ST6_11: Machine learning, statistical data processing and applications using signal processing (e.g. speech, image, video)
  • ST6_9: Human computer interaction, speech recognition and synthesis, natural language processing

Panel:

ST6 - Computer science and informatics: informatics and information systems, computer science, scientific computing, intelligent systems

Host institution :

Politechnika Wrocławska, Wydział Elektroniki

woj. dolnośląskie

Other projects carried out by the institution 

Principal investigator (from the host institution):

dr hab. Henryk Maciejewski 

Number of co-investigators in the project: 3

Call: OPUS 11 - announced on 2016-03-15

Amount awarded: 269 550 PLN

Project start date (Y-m-d): 2017-02-01

Project end date (Y-m-d): 2019-09-30

Project duration:: 32 months (the same as in the proposal)

Project status: Project settled

Project description

Download the project description in a pdf file

Note - project descriptions were prepared by the authors of the applications themselves and placed in the system in an unchanged form.

Equipment purchased [PL]

  1. Komputer - mobilna stacja robocza (2 szt.) (12 000 PLN)
  2. Komputer stacjonarny, z dyskiem w trybie RAID (6 000 PLN)

Information in the final report

  • Publication in academic press/journals (1)
  • Articles in post-conference publications (8)
  1. Algorithm Based on Modified Angle-Based Outlier Factor for Open-Set Classification of Text Documents
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Academic press:
    Applied Stochastic Models in Business and Industry (rok: 2018, tom: Special Issue, strony: 45303), Wydawca: Wiley
    Status:
    Published
    DOI:
    10.1002/asmb.2388 - link to the publication
  1. Feature Extraction in Subject Classification of Text Documents in Polish
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    International Conference on Artificial Intelligence and Soft Computing (rok: 2018, ), Wydawca: Springer International Publishing AG
    Data:
    konferencja 16-20.06.2018
    Status:
    Published
  2. Open Set Subject Classification of Text Documents in Polish by Doc-to-Vec and Local Outlier Factor
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    International Conference on Artificial Intelligence and Soft Computing (rok: 2019, ), Wydawca: Springer Nature Switzerland AG
    Data:
    konferencja 16-20.06.2019
    Status:
    Published
  3. Reduction of dimensionality of feature vectors in subject classification of text documents
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    RelStat 2018 (rok: 2018, ), Wydawca: Springer Nature Switzerland AG
    Data:
    konferencja 17-20.10.2018
    Status:
    Published
  4. Distance metrics in Open-Set Classification of Text Documents by Local Outlier Factor and Doc2Vec
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    IEA / AIE 2019 (rok: 2019, ), Wydawca: Springer Nature Switzerland AG
    Data:
    konferencja 9-11.07.2019
    Status:
    Published
  5. Low-dimensional classification of text documents
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    DEPCOS 2019 (rok: 2019, ), Wydawca: Springer Nature Switzerland AG
    Data:
    konferencja 1-5.07.2019
    Status:
    Published
  6. Utilizing Local Outlier Factor for Open-Set Classification in High-Dimensional Data - Case Study Applied for Text Documents
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    IntelliSys 2019 (rok: 2019, ), Wydawca: Springer Nature Switzerland AG
    Data:
    konferencja 5-6.09.2019
    Status:
    Published
  7. Bag-of-Words, Bag-of-Topics and Word-to-Vec Based Subject Classification of Text Documents in Polish - A Comparative Study
    Authors:
    Tomasz Walkowiak, Szymon Datko, Henryk Maciejewski
    Conference:
    DEPCOS 2018 (rok: 2018, ), Wydawca: Springer International Publishing AG
    Data:
    konferencja 2-4.07.2018
    Status:
    Published
  8. On Validity of Extreme Value Theory-Based Parametric Models for Out-of-Distribution Detection
    Authors:
    Tomasz Walkowiak, Kamil Szyc, Henryk Maciejewski
    Conference:
    ICCS 2021 (rok: 2021, ), Wydawca: Springer
    Data:
    konferencja 6,2021
    Status:
    Accepted for publication