I am a research associate at the Language Technology Lab in the University of Cambridge. I currently work on the PheneBank project (PI: Dr. Nigel Collier) which lies at the intersection of Natural Language Processing and Biomedical sciences. PheneBank seeks to develop a new method for the identification and harmonisation of human phenotypes from the scientific literature as well as their associations to entities of interest such as diseases, genes and other phenotypes.

My research interests are, among others, semantic representation and similarity, ontology alignment and enrichment, Word Sense Disambiguation. Please see my more detailed (but outdated) Research Statement.


Sapienza University

I did my PhD at the Linguistic Computing Laboratory of the Sapienza University of Rome, Italy. My PhD advisor was Dr. Roberto Navigli with whom I did research in lexical semantics with a specific focus on the unified semantic representation of different linguistic items.

University of Tehran

I have an MSc in Artificial Intelligence and Robotics from the Department of Electrical and Computer Engineering (ECE) of the University of Tehran, Iran. For my Master's thesis, I contructed the Tehran English-Persian Parallel corpus (TEP) and carried out research on the automatic translation between English and Persian.



M. Gritta, M. T. Pilehvar, and N. Collier
Which Melbourne? Augmenting Geocoding with Maps.
ACL 2018, Melbourne, Australia.

J. Camacho-Collados and M. T. Pilehvar
From Word to Sense Embeddings: A Survey on Vector Representations of Meaning.
In submission, 2018.

Milan Gritta, Mohammad Taher Pilehvar, Nut Limsopatham, and Nigel Collier
What’s missing in geographical parsing?
Language Resources and Evaluation, pp 1-21, Springer, 2018.

M. T. Pilehvar, J. Camacho-Collados, R. Navigli, and N. Collier.
Towards a Seamless Integration of Word Senses into Downstream NLP Applications.
ACL 2017, Vancouver, Canada.

M. Gritta, M. T. Pilehvar, N. Limsopatham, and N. Collier
Vancouver Welcomes You! Minimalist Location Metonymy Resolution.
ACL 2017, Vancouver, Canada.
Selected as an ACL 2017 Outstanding Paper

J. Camacho-Collados, M. T. Pilehvar, N. Collier, and R. Navigli.
SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity.
SemEval 2017, Vancouver, Canada.

M. T. Pilehvar and N. Collier
Inducing Embeddings for Rare and Unseen Words by Leveraging Lexical Resources.
EACL 2017, pp 388-393, Valencia, Spain.

J. Camacho-Collados, M. T. Pilehvar, and R. Navigli.
Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities.
Artificial Intelligence, 240, pp 36-64, Elsevier, 2016.

M. T. Pilehvar and N. Collier
De-Conflated Semantic Representations.
EMNLP 2016, pp 1680-1690, Austin, TX.

I. Iacobacci, M. T. Pilehvar, and R. Navigli
Embeddings for Word Sense Disambiguation: An Evaluation Study.
ACL 2016, Berlin, Germany.

M. T. Pilehvar and N. Collier
Improved Semantic Representation for Domain-Specific Entities.
BioNLP, ACL 2016, Berlin, Germany.
  • PDF
  • BIBTex
  • Slides

  • ne D. Jurgens and M. T. Pilehvar
    SemEval-2016 Task 14: Semantic Taxonomy Enrichment.
    SemEval 2016, San Diego, California.

    D. Jurgens, M. T. Pilehvar, and R. Navigli.
    Cross Level Semantic Similarity: A Evaluation Framework for Universal Measures of Similarity.
    Language Resources and Evaluation, 50(1), pp. 5-33, Springer, 2016.

    M. T. Pilehvar and R. Navigli.
    From Senses to Texts: An All-in-one Graph-based Approach for Measuring Semantic Similarity.
    Artificial Intelligence, 228, pp. 95-128, Elsevier, 2015.

    I. Iacobacci, M. T. Pilehvar, and R. Navigli.
    SensEmbed: Enhancing Word Embeddings for Semantic Similarity and Relatedness.
    ACL 2015, Beijing, China.

    J. Camacho-Collados, M. T. Pilehvar, and R. Navigli. A Framework for the Construction of Monolingual and Cross-lingual Semantic Similarity Datasets.
    ACL 2015 (short), Beijing, China.

    J. Camacho-Collados, M. T. Pilehvar, and R. Navigli.
    A Unified Multilingual Semantic Representation of Concepts.
    ACL 2015, Beijing, China.

    M. T. Pilehvar, R. Navigli.
    An Open-source Framework for Multi-level Semantic Similarity Measurement.
    NAACL 2015 (demo), Denver, USA.

    D. Jurgens and M. T. Pilehvar.
    Reserating the awesometastic: An automatic extension of the WordNet taxonomy for novel terms.
    NAACL 2015 (short), Denver, USA.

    J. Camacho-Collados, M. T. Pilehvar, and R. Navigli.
    NASARI: a Novel Approach to a Semantically-Aware Representation of Items.
    NAACL 2015, Denver, USA.

    M. T. Pilehvar, and R. Navigli.
    A Large-scale Pseudoword-based Evaluation Framework for State-of-the-Art Word Sense Disambiguation.
    Computational Linguistics, 40(4), pp. 837-881, MIT Press, 2014.

    D. Jurgens, M. T. Pilehvar, R. Navigli.
    SemEval-2014 Task 3: Cross-Level Semantic Similarity.
    SemEval 2014, Dublin, Ireland, 2014.

    M. T. Pilehvar, and R. Navigli.
    A Robust Approach to Aligning Heterogeneous Lexical Resources.
    ACL 2014, Baltimore, USA, 2014.

    M. T. Pilehvar, D. Jurgens and R. Navigli.
    Align, Disambiguate and Walk: A Unified Approach for Measuring Semantic Similarity.
    ACL 2013, Sofia, Bulgaria, 2013.
    Nominated for ACL 2013 best paper award.

    M. T. Pilehvar and R. Navigli.
    Paving the Way to a Large-scale Pseudosense-annotated Dataset. NAACL 2013, Atlanta, USA, 2013.

    M. T. Pilevar, A. H. Pilevar and Heshaam Faili.
    TEP: Tehran English-Persian parallel corpus. CICLING 2011, Tokyo, Japan, 2011.


I enjoy stargazing laying flat on my back outside under the clear skies of my hometown (Hamedan). Many years ago, when I was younger, I used to do some astrophotography. I used to send my photos to Spaceweather.com. I think this is my last photo published on Spaceweather. I once had a photo published in Astronomy Magazine (the largest U.S. magazine on the subject).


  • mp792@cam.ac.uk

Department of Theoretical and Applied Linguistics
University of Cambridge
Sidgwick Avenue
United Kingdom