Fraunhofer INT


Linguistic structures can be used to cover latent information within given data. Further, linguistic helps to create specific text corpora that are necessary to solve problems by use of text analytical means.

Problem: The manual linguistic analysis as well as a manual creation of text corpora is time and cost consuming.

Solution: New linguistic approaches enable the (semi-) automated analysis of linguistic structures in texts as well as the (semi-) automated creation of text corpora.

Case Study

Approaches and case studies are published in

Moohebat, M., Raj, R.G., Thorleuchter, D., Kareem, S.B.A.: Identifying ISI-indexed articles by their lexical usage: A text analysis approach. Journal of the Association for Information Science and Technology 66 (3), 2015, 501-511.

Saloot, M.A., Idris, N., Aw, A.T., Thorleuchter, D.: Twitter Corpus Creation The Case of a Malay Chat-style-text Corpus (MCC). Digital Scholarship in the Humanities 31(2), 2016, 227-243.

Moohebat, M., Thorleuchter, D., Raj, R.G., Kareem, S.B.A.: Linguistic Feature Classifying and Tracing. Malaysian Journal of Computer Science, in press.

Saloot, M.A., Idris, N., Mahmud, R., Jaafar, S., Thorleuchter, D., Gani, A.: Hadith Data Mining and Classification: A Comparative Analysis. Artificial Intelligence Review 46(1), 2016, 113-128.

Thorleuchter, D., Van den Poel, D.: Using Text Summarizing to support Planning of Research and Development. Advances in Intelligent Systems and Computing, 275, 2014, 23-29, Springer, Berlin.

