Proyag Pal's CV
If you’d prefer a one-page résumé: see PDF.
- PhD in Informatics — University of Edinburgh — 2020-2023 (expected)
- MSc in Informatics — University of Edinburgh — 2016-2017
- 1-year postgraduate degree in Informatics, specialised in Natural Language Processing.
- Graduated with distinction.
- Courses included Machine Translation, Accelerated Natural Language Processing, Machine Learning and Pattern Recognition, Machine Learning Practical (Deep Learning).
- MSc in Computer Science — St. Xavier’s College, Kolkata — 2014-2016
- 2-year postgraduate degree in Computer Science.
- Graduated with 82% marks.
- Courses included Artificial Intelligence, Data Mining and Warehousing, Image Processing and Pattern Recognition.
- BSc in Computer Science — St. Xavier’s College, Kolkata — 2011-2014
- 3-year undergraduate degree with honours in Computer Science, and Mathematics and Physics as general subjects.
- Graduated with 80% marks.
- TAUS — Data Engineer — Amsterdam, Jun-Oct 2020
- Worked on the ParaCrawl project.
- Optimised, maintained, and ran a highly scalable processing pipeline to extract, translate, align, and clean parallel corpora obtained through web crawling.
- Unbabel — Junior AI Researcher — Lisbon, Feb-Apr 2020
- Machine translation and Quality estimation for customer-facing product teams.
- World Intellectual Property Organization (WIPO) — Fellow in Machine Translation — Geneva, Feb 2018-Jan 2020
- Fellow in Neural Machine Translation at the Advanced Technology Applications Center (ATAC), working on WIPO Translate.
- Built, improved, evaluated and deployed domain-specific neural and statistical machine translation models using the Marian and Moses toolkits.
- Developed neural text classification systems for patent categorisation (IPCCAT).
- Developed a system using BERT/XLM/LASER sentence representations and Faiss indexes to retrieve semantically similar content from large collections of text.
- Assisted in the adoption of Neural MT at IMF, WTO, KIPO, and some other international organisations and IP offices.
- University of Edinburgh — Research Assistant — Edinburgh, Sep-Dec 2017
- Research assistant at the Institute for Language, Cognition and Computation (ILCC).
- Worked on an EPSRC GCRF-funded project in collaboration with the University of Cape Town to build a machine translation system to facilitate communication in the medical domain between isiXhosa-speaking patients and English-speaking doctors in health centres in South Africa.
- Supervised by Dr. Kenneth Heafield and Dr. Alexandra Birch.
Also see: Projects
- Bengali - Mother tongue
- English - Native
- French - Intermediate (B1)
- Hindi - Fluent
- Chinese (Mandarin) - Basic