Mike Zhang

Research Intern @NEC Laboratories Europe

& PhD Fellow in NLP @NLPnorth, IT University of Copenhagen

& a Research Visitor @MaiNLP (CIS), Ludwig Maximilian University of Munich

Jump to: [ News | Publications | Teaching | Resume | Contact ]


Hello there! My name is Mike Zhang. I'm a third-year PhD Student at the IT University of Copenhagen (ITU) under supervision of Prof. Barbara Plank. I am part of the NLPnorth research unit and also affiliated with the MaiNLP Lab at CIS (Center for Information and Language Processing) at the Ludwig Maximilian University of Munich (LMU).

In the Fall semester, I will be a research intern at NEC Laboratories Europe and working with Dr. Kiril Gashteovski and Dr. Carolin Lawrence.

In Spring 2023, I will also be a Research Intern at WING (Web Information Retrieval & Natural Language Processing Group) at the National University of Singapore, advised by Prof. Min-Yen Kan. I will work on NLP and IR related to job descriptions and related text sources.

My main focus is working on automated high-quality Information Extraction from unstructured text with real-life use cases that have societal impact. In my case, I am working on Skill Extraction for Job Market Analysis. My other interests include tricks and approaches to get more labeled training data and/or exploit models for tasks with limited data — this includes Active Learning, Weak Supervision, and Transfer Learning.


News


Publications

2022

[8] Mike Zhang, Kristian Nørgaard Jensen, Rob van der Goot, and Barbara Plank. 2022. Skill Extraction from Job Postings using Weak Supervision. In Workshop on Recommender Systems for Human Resources 2022 (RecSysHR2022), in conjunction with the 16th ACM Conference on Recommender Systems 2022 (RecSys). [Paper] [Code]

[7] Dennis Ulmer, Elisa Bassignana, Max Müller-Eberstein, Daniel Varab, Mike Zhang, Christian Hardmeier, and Barbara Plank. 2022. Experimental Standards for Deep Learning Research: A Natural Language Processing Perspective. In the Machine Learning Evaluation Standards Workshop at ICLR 2022 (SMILES). Outstanding Paper Award. [Paper] [Repository]

[6] Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, and Barbara Plank. 2022. SkillSpan: Hard and Soft Skill Extraction from Job Postings. In Proceedings of the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). [Paper] [Code]

[5] Mike Zhang, Kristian Nørgaard Jensen, and Barbara Plank. 2022. Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning. In Proceedings of the 13th Edition of the Language Resources and Evaluation Conference (LREC). [Paper] [Code]

2021

[4] Mike Zhang and Barbara Plank. 2021. Cartography Active Learning. In Findings of the Association for Computational Linguistics: EMNLP 2021. [Paper] [Slides] [Code] [Video]

[3] Kristian Nørgaard Jensen, Mike Zhang and Barbara Plank. 2021. De-identification of Privacy-related Entities in Job Postings. In Proceedings of the 23rd Nordic Conference of Computational Linguistics (NoDaLiDa). [Paper] [Slides] [Code] [Video]

2019

[2] Mike Zhang and Antonio Toral. 2019. The Effect of Translationese in Machine Translation Test Sets. In Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers). [Paper] [Slides] [Code]

[1] Mike Zhang, Roy David, Leon Graumans and Gerben Timmerman. 2019. Grunn2019 at SemEval-2019 Task 5: Shared Task on Multilingual Detection of Hate. In Proceedings of the 13th International Workshop on Semantic Evaluation. [Paper] [Code]


Teaching

IT University of Copenhagen

Spring 2021, 2022

BSSEYEP1KU, Introduction to NLP and Deep Learning (Senior TA, Lecturer)

Spring 2022

Master Thesis, Computer Science. (Supervision)

Fall 2021

Master Research Project, Computer Science. (Supervision)

Fall 2021

PhD Course, Communicating State-of-the-art NLP Research to a Broader Audience (Co-Organizer)


University of Groningen

Fall 2020

SOMINDW07, Machine Learning (Head TA)

Fall 2019

SOMINDW07, Machine Learning (Head TA)

LIX016M05, Learning from Data (Head TA)

Spring 2019

LIX017B05, Social Media (TA)


Resume

Academic Background

Current

PhD Computer Science, IT University of Copenhagen

2020

MA Information Science, University of Groningen

2018

BSc Information Science, University of Groningen


Other (Professional) Experiences

01/2020 - 08/2020

Data Engineer, Dataprovider.com B.V.

09/2019 - 12/2019

Research Engineer Intern, Dataprovider.com B.V.



Services

Chairing: LREC (Co-chair, 2022)

Reviewer: ACL (2019-), EMNLP (2021-), ARR (2021-), CoNLL (2021-), NAACL SRW (2022), W-NUT (2021), RecSysHR (2022)


Contact

You can reach me at mikz(at)itu(dot)dk or message me on any other platform here.