I am an assistant professor at the Computer Engineering department of Iran University of Science and Technology and an affiliated lecturer at the University of Cambridge.

My research interests are, among others, semantic representation and similarity, ontology alignment and enrichment, Word Sense Disambiguation (see my more detailed (but outdated) research statement).


University of Cambridge

I was a research associate at the Language Technology Lab of the University of Cambridge for three years. During this, I worked on PheneBank (PI: Dr. Nigel Collier), a project at the intersection of Natural Language Processing and Biomedical sciences, funded by the UK's Medical Research Council.

Sapienza University

I did my PhD at the Linguistic Computing Laboratory of the Sapienza University of Rome, Italy. My PhD advisor was Dr. Roberto Navigli with whom I did research in lexical semantics with a specific focus on the unified semantic representation of different linguistic items.

University of Tehran

I have an MSc in Artificial Intelligence and Robotics from the Department of Electrical and Computer Engineering (ECE) of the University of Tehran, Iran. For my Master's thesis, I contructed the Tehran English-Persian Parallel corpus (TEP) and carried out research on the automatic translation between English and Persian.


  • Apr 2019: A new version of WiC dataset is released.
  • Apr 2019: Co-organizing a shared task on sense distinction in the SemDeep workshop at IJCAI 2019.
  • Mar 2019: Three papers accepted at NAACL 2019. Cannot attend the conference due to travel ban though!
  • Feb 2019: Happy to have been selected as the best professor in the AI and Software groups according to the end-of-term student evaluation (department of Computer Engineering, IUST).
  • Jan 2019: Teaching three courses this semester: Artificial Intelligence, Advanced Programming (in Java), and Deep Learning (graduate). More info here.
  • Jan 2019: A beta-version of PheneBank demo is online; feedbacks are welcome!
  • Dec 2018: Serving as an area chair for ACL 2019.
  • Nov 2018: Paper on unseen word representation accepted to AAAI 2019. Congrats to Victor, my PhD student.
  • Sep 2018: Code and resources for Card-660 and MS-LSTM (EMNLP 2018).
  • Sep 2018: Check our challenging benchmark for context-sensitive or sense embeddings: WiC (the Word-in-Context dataset).
  • Aug 2018: Our survey on sense representation was accepted for publication at the Journal of Artificial Intelligence Research (JAIR).
  • Aug 2018: 3 long papers accepted at EMNLP 2018! Also, 2 papers at BlackboxNLP'18 and RDSM'18 workshops.
  • ► I was offered lectureship positions from some of the best universities in the UK, including Exeter and Southampton (second best ECs department in the UK), all turned down! Going back home!!


V. Prokhorov, M. T. Pilehvar, D. Kartsaklis, P. Lio, and N. Collier
Unseen Word Representation by Aligning Heterogeneous Lexical Semantic Spaces.
AAAI 2019, Hawaii, USA.

M. T. Pilehvar and J. Camacho-Collados
WiC: the Word-in-Context Dataset for Evaluating Context-Sensitive Meaning Representations.
NAACL 2019, Minneapolis, USA.

M. T. Pilehvar
On the Importance of Distinguishing Word Meaning Representations: A Case Study on Reverse Dictionary Mapping.
NAACL 2019, Minneapolis, USA.

V. Prokhorov, M. T. Pilehvar, and N. Collier
Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models.
NAACL 2019, Minneapolis, USA.

M. T. Pilehvar, D. Kartsaklis, V. Prokhorov and N. Collier
Card-660: Cambridge Rare Word Dataset - a Reliable Benchmark for Infrequent Word Representation Models.
EMNLP 2018, Brussels, Belgium.

D. Kartsaklis, M. T. Pilehvar and N. Collier
Mapping Text to Knowledge Graph Entities using Multi-Sense LSTMs.
EMNLP 2018, Brussels, Belgium.

H. Le, D. Can, S. T. Vu, T. H. Dang, M. T. Pilehvar and N. Collier
Large-scale Exploration of Neural Relation Classification Architectures.
EMNLP 2018, Brussels, Belgium.

M. Gritta, M. T. Pilehvar, and N. Collier
Which Melbourne? Augmenting Geocoding with Maps.
ACL 2018, Melbourne, Australia.

J. Camacho-Collados and M. T. Pilehvar
From Word to Sense Embeddings: A Survey on Vector Representations of Meaning.
Journal of Artificial Intelligence Research, 2018.

Milan Gritta, Mohammad Taher Pilehvar, Nut Limsopatham, and Nigel Collier
What’s missing in geographical parsing?
Language Resources and Evaluation, pp 1-21, Springer, 2018.


I enjoy stargazing laying flat on my back outside under the clear skies of my hometown (Hamedan). Many years ago, when I was younger, I used to do some astrophotography. I used to send my photos to Spaceweather.com. I think this is my last photo published on Spaceweather. I once had a photo published in Astronomy Magazine (the largest U.S. magazine on the subject).


  • mp792@cam.ac.uk

Department of Theoretical and Applied Linguistics
University of Cambridge
Sidgwick Avenue
United Kingdom