Hello there! My name is Mike Zhang. I'm a third-year PhD Student at the IT University of Copenhagen (ITU) under supervision of Prof. Barbara Plank. I am part of the NLPnorth research unit and also affiliated with the MaiNLP Lab at CIS (Center for Information and Language Processing) at the Ludwig Maximilian University of Munich (LMU).
In Spring 2023, I will also be a Research Intern at WING (Web Information Retrieval & Natural Language Processing Group) at the National University of Singapore, advised by Prof. Min-Yen Kan. I will work on NLP and IR related to job descriptions and related text sources.
My main focus is working on automated high-quality Information Extraction from unstructured text with real-life use cases that have societal impact. In my case, I am working on Skill Extraction for Job Market Analysis. My other interests include tricks and approaches to get more labeled training data and/or exploit models for tasks with limited data — this includes Active Learning, Weak Supervision, and Transfer Learning.
- 01/09/2022: Started as a Research Intern at NEC Laboratories Europe.
- 28/08/2022: Paper accepted at RecSysHR 2022, see .
- 12/07/2022: Presented  at NAACL 2022.
- 21/06/2022: Presented  at LREC 2022.
- 29/04/2022: We got an outstanding paper award (see ) at the Machine Learning Evaluation Standards Workshop at ICLR 2022!
- 08/04/2022: Paper accepted (see ) at NAACL 2022.
- 04/04/2022: Paper accepted (see ) at LREC 2022.
- 22/10/2021: Gave a talk about our work Active Learning at the GroNLP group.
- 27/08/2021: Our work on Active Learning (see ) is accepted to EMNLP 2021.
- 02/06/2021: Accepted to LxMLS 2021.
- 02/06/2021: Presented (see ) at NoDaLiDa 2021 [video].
- 22/03/2021: First paper accepted at NoDaLiDa 2021 (see )!
- 15/09/2020: Started in Barbara Plank's NLP lab as a PhD Student, part of NLPnorth.
- 02/08/2019: Presented (see ) at WMT in Florence during ACL 2019.
 Mike Zhang, Kristian Nørgaard Jensen, Rob van der Goot, and Barbara Plank. 2022. Skill Extraction from Job Postings using Weak Supervision. In Workshop on Recommender Systems for Human Resources 2022 (RecSysHR2022), in conjunction with the 16th ACM Conference on Recommender Systems 2022 (RecSys). [Paper] [Code]
 Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Christian Hardmeier, and Barbara Plank. 2022. Experimental Standards for Deep Learning Research: A Natural Language Processing Perspective. In the Machine Learning Evaluation Standards Workshop at ICLR 2022 (SMILES). Outstanding Paper Award. [Paper] [Repository]
 Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, and Barbara Plank. 2022. SkillSpan: Hard and Soft Skill Extraction from Job Postings. In Proceedings of the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). [Paper] [Code]
 Mike Zhang, Kristian Nørgaard Jensen, and Barbara Plank. 2022. Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning. In Proceedings of the 13th Edition of the Language Resources and Evaluation Conference (LREC). [Paper] [Code]
 Kristian Nørgaard Jensen, Mike Zhang and Barbara Plank. 2021. De-identification of Privacy-related Entities in Job Postings. In Proceedings of the 23rd Nordic Conference of Computational Linguistics (NoDaLiDa). [Paper] [Slides] [Code] [Video]
 Mike Zhang and Antonio Toral. 2019. The Effect of Translationese in Machine Translation Test Sets. In Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers). [Paper] [Slides] [Code]
 Mike Zhang, Roy David, Leon Graumans and Gerben Timmerman. 2019. Grunn2019 at SemEval-2019 Task 5: Shared Task on Multilingual Detection of Hate. In Proceedings of the 13th International Workshop on Semantic Evaluation. [Paper] [Code]
IT University of Copenhagen
Spring 2021, 2022
BSSEYEP1KU, Introduction to NLP and Deep Learning (Senior TA, Lecturer)
Master Thesis, Computer Science. (Supervision)
Master Research Project, Computer Science. (Supervision)
PhD Course, Communicating State-of-the-art NLP Research to a Broader Audience (Co-Organizer)
University of Groningen
SOMINDW07, Machine Learning (Head TA)
SOMINDW07, Machine Learning (Head TA)
LIX016M05, Learning from Data (Head TA)
LIX017B05, Social Media (TA)
PhD Computer Science, IT University of Copenhagen
MA Information Science, University of Groningen
BSc Information Science, University of Groningen
Other (Professional) Experiences
01/2020 - 08/2020
Data Engineer, Dataprovider.com B.V.
09/2019 - 12/2019
Research Engineer Intern, Dataprovider.com B.V.
Chairing: LREC (Co-chair, 2022)
Reviewer: ACL (2019-), EMNLP (2021-), ARR (2021-), CoNLL (2021-), NAACL SRW (2022), W-NUT (2021), RecSysHR (2022)
You can reach me at mikz(at)itu(dot)dk or message me on any other platform here.