Projects funded by the NCN


Information on the principal investigator and host institution

Information of the project and the call

Keywords

Equipment

Delete all

Learning classifiers from imbalanced and evolving data

2013/11/B/ST6/00963

Keywords:

supervised learning of classifiers imbalanced data evolving data streams concept drift

Descriptors:

  • ST6_11: Machine learning, statistical data processing and applications using signal processing (e.g. speech, image, video)
  • ST6_7: Artificial intelligence, intelligent systems, multi-agent systems

Panel:

ST6 - Computer science and informatics: informatics and information systems, computer science, scientific computing, intelligent systems

Host institution :

Politechnika Poznańska, Wydział Informatyki

woj. wielkopolskie

Other projects carried out by the institution 

Principal investigator (from the host institution):

dr hab. Jerzy Stefanowski 

Number of co-investigators in the project: 6

Call: OPUS 6 - announced on 2013-09-16

Amount awarded: 497 770 PLN

Project start date (Y-m-d): 2014-07-16

Project end date (Y-m-d): 2017-07-15

Project duration:: 36 months (the same as in the proposal)

Project status: Project settled

Information in the final report

  • Publication in academic press/journals (15)
  • Articles in post-conference publications (15)
  • Book publications / chapters in book publications (9)
  1. SMOTE-IPF: Addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering.
    Authors:
    José A. Sáez, Julián Luengo, Jerzy Stefanowski, Francisco Herrera
    Academic press:
    Information Sciences (rok: 2015, tom: 291, strony: 184-203), Wydawca: Elsevier
    Status:
    Published
    DOI:
    10.1016/j.ins.2014.08.051 - link to the publication
  2. Can interestingness measures be usefully visualized?
    Authors:
    Robert Susmaga,Izabela Szczęch
    Academic press:
    International Journal of Applied Mathematics and Computer Science (rok: 2015, tom: 25 (2), strony: 323–336), Wydawca: bd.
    Status:
    Published
    DOI:
    10.1515/amcs-2015-0025 - link to the publication
  3. Measures of rule interestingness in various perspectives of confirmation
    Authors:
    Salvatore Greco, Roman Słowiński, Izabela Szczęch
    Academic press:
    Information Sciences (rok: 2016, tom: 346-347, strony: 216-235), Wydawca: Elsevier
    Status:
    Published
    DOI:
    10.1016/j.ins.2016.01.056 - link to the publication
  4. Types of Minority Class Examples and Their Influence on Learning Classifiers from Imbalanced Data
    Authors:
    Krystyna Napierała, Jerzy Stefanowski
    Academic press:
    Journal of Intelligent Information Systems (rok: 2016, tom: 46 (3), strony: 563-597), Wydawca: Springer
    Status:
    Published
    DOI:
    10.1007/s10844-015-0368-1 - link to the publication
  5. Addressing imbalanced data with argument based rule learning
    Authors:
    Krystyna Napierała, Jerzy Stefanowski
    Academic press:
    Expert Systems with Applications (rok: 2015, tom: 42 (24), strony: 9468-9481), Wydawca: Elsevier
    Status:
    Published
    DOI:
    10.1016/j.eswa.2015.07.076 - link to the publication
  6. Open Challenges for Data Stream Mining Research
    Authors:
    Georg Krempl, Indre Zliobaite Dariusz Brzezinski, Eyke Hullermeier, Mark Last, Vincent Lemaire, Tino Noack, Ammar Shaker, Sonja Sievi, Myra Spiliopoulou, Jerzy Stefanowski
    Academic press:
    SIGKDD Explorations (rok: 2014, tom: 16(1), strony: 45301), Wydawca: ACM
    Status:
    Published
  7. Post-processing of BRACID Rules Induced from Imbalanced Data
    Authors:
    Krystyna Napierała, Jerzy Stefanowski
    Academic press:
    Fundamenta Informaticae (rok: 2016, tom: 148 (1-2), strony: 51-64), Wydawca: IOS Press
    Status:
    Published
    DOI:
    10.3233/FI-2016-1300 - link to the publication
  8. Abstaining in Rule Set Bagging for Imbalanced Data
    Authors:
    Krystyna Napierała, Jerzy Stefanowski
    Academic press:
    Logic Journal of the IGPL (rok: 2015, tom: 23 (3), strony: 421-430), Wydawca: Oxford University Press
    Status:
    Published
    DOI:
    10.1093/jigpal/jzv006 - link to the publication
  9. Neighbourhood Sampling in Bagging for Imbalanced Data
    Authors:
    Jerzy Błaszczyński, Jerzy Stefanowski
    Academic press:
    Neurocomputing (rok: 2015, tom: 150 (B), strony: 529-542), Wydawca: Elsevier
    Status:
    Published
    DOI:
    10.1016/j.neucom.2014.07.064 - link to the publication
  10. Difficulty Factors and Preprocessing in Imbalanced Data Sets: An Experimental Study on Artificial Data
    Authors:
    Szymon Wojciechowski, Szymon Wilk
    Academic press:
    Foundations of Computing and Decision Sciences (rok: 2017, tom: 42 (2), strony: 149-176), Wydawca: De Gruyter (elektroniczny open access) i Wydawnictwo PP (papierowa wersja)
    Status:
    Published
    DOI:
    10.1515/fcds-2017-0007 - link to the publication
  11. Ensemble learning for data stream analysis: a survey
    Authors:
    Bartosz Krawczyk, Leandro L. Minku, Joao Gama, Jerzy Stefanowski, Michał Woźniak
    Academic press:
    Information Fusion (rok: 2017, tom: 37, strony: 132-156), Wydawca: Elsevier
    Status:
    Published
    DOI:
    10.1016/j.inffus.2017.02.004 - link to the publication
  12. Learning Ensemble Classifiers for Diabetic Retinopathy Assessment
    Authors:
    Emran Saleh, Jerzy Blaszczynski, Antonio Moreno, Aida Valls, Pedro Romero-Aroca, Sofia de la Riva-Fernandez, Roman Slowinski
    Academic press:
    Artificial Intelligence in Medicine (rok: 2018, tom: 85, strony: 50-63), Wydawca: Elsevier
    Status:
    Published
    DOI:
    10.1016/j.artmed.2017.09.006 - link to the publication
  13. Multi-class and feature selection extensions of Roughly Balanced Bagging for imbalanced data
    Authors:
    Mateusz Lango, Jerzy Stefanowski
    Academic press:
    Journal of Intelligent Information Systems (rok: 2018, tom: 50(1), strony: 97-127), Wydawca: Springer
    Status:
    Published
    DOI:
    10.1007/s10844-017-0446-7 - link to the publication
  14. Prequential AUC: Properties of the Area Under the ROC Curve for Data Streams with Concept Drift
    Authors:
    Dariusz Brzezinski, Jerzy Stefanowski
    Academic press:
    Knowledge and Information Systems (rok: 2017, tom: 52(2), strony: 531-562), Wydawca: Springer
    Status:
    Published
    DOI:
    10.1007/s10115-017-1022-8 - link to the publication
  15. Visualization support for the analysis of properties of interestingness measures
    Authors:
    Robert Susmaga, Izabela Szczęch
    Academic press:
    Bulletin of the Polish Academy of Sciences Technical Sciences (rok: 2015, tom: 63 (1), strony: 315--327), Wydawca: PAN
    Status:
    Published
    DOI:
    10.1515/bpasts-2015-0036 - link to the publication
  1. Applicability of Roughly Balanced Bagging for Complex Imbalanced Data.
    Authors:
    Mateusz Lango, Jerzy Stefanowski
    Conference:
    Proc. of 4th Int. Workshop NFMCP in conjunction with ECML-PKDD 2015 Conference, Porto (rok: 2015, ), Wydawca: brak
    Data:
    konferencja 7-11 wrzesnia
    Status:
    Published
  2. Application of preprocessing methods to imbalanced clinical data: an experimental study
    Authors:
    Szymon Wilk, Jerzy Stefanowski, Szymon Wojciechowski, Ken J. Farion, Wojciech Michalowski
    Conference:
    Information Technologies in Biomedicine. 5th International Conference, ITIB 2016 (rok: 2016, ), Wydawca: Advances in Intelligent Systems and Computing 471, Springer
    Data:
    konferencja 20-22 czerwca
    Status:
    Published
  3. Evaluating Difficulty of Multi-class Imbalanced Data
    Authors:
    Mateusz Lango, Krystyna Napierała, Jerzy Stefanowski
    Conference:
    Proc. Foundations of Intelligent Systems - 23rd International Symposium, ISMIS 2017 (rok: 2017, ), Wydawca: Springer LNCS vol. 10352
    Data:
    konferencja 26-29 czerwca
    Status:
    Published
  4. Actively Balanced Bagging for Imbalanced Data
    Authors:
    Jerzy Błaszczyński, Jerzy Stefanowski
    Conference:
    Proc. Foundations of Intelligent Systems - 23rd International Symposium, ISMIS 2017 (rok: 2017, ), Wydawca: Springer LNCS vol. 10352
    Data:
    konferencja 26-29 czerwca
    Status:
    Published
  5. Ensemble Diversity in Evolving Data Streams
    Authors:
    Dariusz Brzezinski, Jerzy Stefanowski
    Conference:
    Proceedings of 19th International Conference, Discovery Science: DS 2016 (rok: 2016, ), Wydawca: Springer: Lecture Notes in Computer Science vol. 9956
    Data:
    konferencja 19-21 października
    Status:
    Published
  6. Fusion of Clinical Data: A Case Study to Predict the Type of Treatment of Bone Fractures
    Authors:
    Anam Haq, Szymon Wilk
    Conference:
    New Trends in Databases and Information Systems -ADBIS2017. Short Papers and Workshops: BigNovelTI, DaS, SW4CH, AMSD (rok: 2017, ), Wydawca: Springer, Series Communications in Computer and Information Science, vol 767.
    Data:
    konferencja 24-27 września
    Status:
    Published
  7. An Algorithm for Selective Preprocessing of Multi-class Imbalanced Data
    Authors:
    Szymon Wojciechowski, Szymon Wilk, Jerzy Stefanowski
    Conference:
    10th International Conference on Computer Recognition Systems CORES 2017 (rok: 2017, ), Wydawca: Springer
    Data:
    konferencja 22-24 maja
    Status:
    Published
  8. Diversity Analysis on Imbalanced Data Using Neighbourhood and Roughly Balanced Bagging Ensembles
    Authors:
    Jerzy Błaszczynski, Mateusz Lango
    Conference:
    Proceeding of the Int. Conference ICAISC 2016 (rok: 2016, ), Wydawca: Springer LNAI vol. 9692
    Data:
    konferencja 12-16 czerwca
    Status:
    Published
  9. A Data- and Expert-driven Decision Support Framework for Helping Patients Adhere to Therapy: Psychobehavioral Targets and Associated Interventions
    Authors:
    Szymon Wilk, Dympna O'Sullivan, Martin Michalowski, Silvia Bonaccio, Wojtek Michalowski, Mor Peleg, Marc Carrier
    Conference:
    Proceedings of the International Joint Workshop on Knowledge Representation for Health Care, Process-Oriented Information Systems in Health Care, Extraction and Processing of Rich Semantics from Medical Texts (KR4HC-ProHealth-RichMedSem 2017) (rok: 2017, ), Wydawca: Materialy towarzyszace międzynarod. konf. AIME
    Data:
    konferencja 24 czerwca
    Status:
    Published
  10. On Properties of Under-sampling Bagging and its Extensions for Imbalanced Data
    Authors:
    Jerzy Stefanowski
    Conference:
    the 9th International Conference on Computer Recognition Systems CORES 2015, (rok: 2015, ), Wydawca: Springer Advances in Intelligent Systems and Computing - Postproceedings ukaze sie w 2016
    Data:
    konferencja 25-27 maja
    Status:
    Published
  11. Prequential AUC for Classifier Evaluation and Drift Detection in Evolving Data Streams
    Authors:
    Dariusz Brzeziński, Jerzy Stefanowski
    Conference:
    Electronic proceedings of 3rd ECML PKDD International Workshop on New Frontiers in Mining Complex Patterns, Nancy, Francja (rok: 2014, ), Wydawca: bd
    Data:
    konferencja 15-19 wrzesnia 2014
    Status:
    Published
  12. Tetrahedron: Barycentric Measure Visualizer
    Authors:
    Dariusz Brzezinski, Jerzy Stefanowski, Robert Susmaga, Izabela Szczęch
    Conference:
    Machine Learning and Knowledge Discovery in Databases, European Conference, ECML PKDD 2017 Skopje (rok: 2017, ), Wydawca: Springer, Proceedings ECML PKDD 2017, part III, Springer LNAI vol. 10536,
    Data:
    konferencja 18–22 września
    Status:
    Published
  13. Discovering Minority Sub-clusters and Local Difficulty Factors from Imbalanced Data
    Authors:
    Mateusz Lango, Dariusz Brzezinski, Sebastian Firlik, Jerzy Stefanowski
    Conference:
    20th International Conference Discovery Science, DS 2017 (rok: 2017, ), Wydawca: Springer, LNCS vol. 10558
    Data:
    konferencja 15-17 października
    Status:
    Published
  14. Impact of Local Data Characteristics on Learning Rules from Imbalanced Data
    Authors:
    Jerzy Stefanowski
    Conference:
    Electronic Proceedings of 6th Rough Set Theory Workshop (RST'2015) in conjunction with PReMI 2015 conference (rok: 2015, ), Wydawca: brak
    Data:
    konferencja 29 czerwca 2015
    Status:
    Published
  15. PUT at SemEval-2016 Task 4: The {ABC} of Twitter Sentiment Analysis.
    Authors:
    Mateusz Lango, Dariusz Brzezinski, Jerzy Stefanowski
    Conference:
    Proceedings of the 10th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT 2016 (rok: 2016, ), Wydawca: Association for Computational Linguistics
    Data:
    konferencja 16-17 czerwca
    Status:
    Published
  1. Adaptive Ensembles for Evolving Data Streams--Combining Block-Based and Online Solutions
    Authors:
    Jerzy Stefanowski
    Book:
    M. Ceci et al. (eds): New Frontiers in Mining Complex Patterns. Revised Selected Papers from 4th International Workshop, NFMCP 2015, Held in Conjunction with ECML-PKDD 2015 (rok: 2016, tom: LNAI 9607, strony: 3–16), Wydawca: Springer
    Status:
    Published
  2. Increasing the Interpretability of Rules Induced from Imbalanced Data by Using Bayesian Confirmation Measures
    Authors:
    Krystyna Napierała, Jerzy Stefanowski, Izabela Szczęch
    Book:
    Annalisa Appice et al. (eds) New Frontiers in Mining Complex Patterns. Revised selected papers from the Fifth International Workshop, NFMCP 2016, Held in Conjunction with ECML-PKDD 2016 (rok: 2017, tom: LNAI 10312, strony: 84-98), Wydawca: Springer
    Status:
    Published
  3. Local Data Characteristics in Learning Classifiers from Imbalanced Data
    Authors:
    Jerzy Błaszczyński, Jerzy Stefanowski
    Book:
    Adam Gaweda , Janusz Kacprzyk, Leszek Rutkowski i Gary G. Yen (red.): Advances in Data Analysis with Computational Intelligence Methods. Studies in Computational Intelligence Series. (rok: 2018, tom: vol. 738, strony: 51-85), Wydawca: Springer
    Status:
    Published
  4. Final Remarks on Big Data Analysis and Its Impact on Society and Science
    Authors:
    Jerzy Stefanowski, Nathalie Japkowicz
    Book:
    Big Data Analysis: New Algorithms for a New Society (rok: 2016, tom: St Big Data 16, strony: 305-329), Wydawca: Springer
    Status:
    Published
  5. Bayesian Confirmation Measures in Rule-based Classification
    Authors:
    Dariusz Brzezinski, Zbigniew Grudziński, Izabela Szczęch
    Book:
    Annalisa Appice et al. (eds) New Frontiers in Mining Complex Patterns. Revised selected papers from the Fifth International Workshop, NFMCP 2016, Held in Conjunction with ECML-PKDD 2016 (rok: 2017, tom: LNAI 10312, strony: 39-53), Wydawca: Springer
    Status:
    Published
  6. Dealing with Data Difficulty Factors while Learning from Imbalanced Data
    Authors:
    Jerzy Stefanowski
    Book:
    Challenges in Statistics and Data Mining (rok: 2016, tom: 605, strony: 333-363), Wydawca: Studies in Computational Intelligence Series, Springer
    Status:
    Published
  7. The Usefulness of Roughly Balanced Bagging for Complex and High-dimensional Imbalanced Data
    Authors:
    Mateusz Lango, Jerzy Stefanowski
    Book:
    M. Ceci et al. (eds): New Frontiers in Mining Complex Patterns. Revised Selected Papers from 4th International Workshop, NFMCP 2015, Held in Conjunction with ECML-PKDD 2015 (rok: 2016, tom: LNAI 9607, strony: 93-107), Wydawca: Springer
    Status:
    Published
  8. Ensemble Classifiers for Imbalanced and Evolving Data Streams
    Authors:
    Dariusz Brzeziński, Jerzy Stefanowski
    Book:
    Mark Last, Abraham Kandel and Horst Bunke (eds): Data Mining in Time Series and Streaming Databases; (World Scientific) Series in Machine Perception and Artificial Intelligence (rok: 2018, tom: b.d., strony: 44-68), Wydawca: World Scientific
    Status:
    Published
  9. Prequential AUC for Classifier Evaluation and Drift Detection in Evolving Data Streams
    Authors:
    Dariusz Brzeziński, Jerzy Stefanowski
    Book:
    Annalisa Appice et al. (eds) New Frontiers in Mining Complex Patterns. (rok: 2015, tom: 8983, strony: 87–101), Wydawca: Springer , Lecture Notes in Computer Science
    Status:
    Published