CV
Download résumé
Education
- PhD in Informatics — University of Edinburgh — 2020-2024 (expected)
- 3-year doctoral programme at the Institute for Language, Cognition and Computation (ILCC), working with the StatMT group.
- Working on using multi-encoder models to provide additional context to neural machine translation models to analyse and improve them.
- Supervised by Dr. Kenneth Heafield.
- MSc in Informatics — University of Edinburgh — 2016-2017
- 1-year postgraduate degree in Informatics, specialised in Natural Language Processing.
- Graduated with distinction.
- Courses included Machine Translation, Accelerated Natural Language Processing, Machine Learning and Pattern Recognition, Machine Learning Practical (Deep Learning).
- MSc in Computer Science — St. Xavier’s College, Kolkata — 2014-2016
- 2-year postgraduate degree in Computer Science.
- Graduated with 82% marks.
- Courses included Artificial Intelligence, Data Mining and Warehousing, Image Processing and Pattern Recognition.
- BSc in Computer Science — St. Xavier’s College, Kolkata — 2011-2014
- 3-year undergraduate degree with honours in Computer Science, and Mathematics and Physics as general subjects.
- Graduated with 80% marks.
Experience
- University of Zurich — Visiting Researcher — Zurich, Mar 2023-May 2023
- Amazon AWS AI — Applied Scientist Intern — Santa Clara, Nov 2022-Feb 2023
- Worked on isochronous machine translation for automatic dubbing.
- Co-organised the dubbing track at IWSLT 2023.
- TAUS — Data Engineer — Amsterdam, Jun-Oct 2020
- Worked on the ParaCrawl project.
- Optimised, maintained, and ran a highly scalable processing pipeline to extract, translate, align, and clean parallel corpora obtained through web crawling.
- Unbabel — Junior AI Researcher — Lisbon, Feb-Apr 2020
- Machine translation and Quality estimation for customer-facing product teams.
- World Intellectual Property Organization (WIPO) — Fellow in Machine Translation — Geneva, Feb 2018-Jan 2020
- Fellow in Neural Machine Translation at the Advanced Technology Applications Center (ATAC), working on WIPO Translate.
- Built, improved, evaluated and deployed domain-specific neural and statistical machine translation models using the Marian and Moses toolkits.
- Developed neural text classification systems for patent categorisation (IPCCAT).
- Developed a system using BERT/XLM/LASER sentence representations and Faiss indexes to retrieve semantically similar content from large collections of text.
- Assisted in the adoption of Neural MT at IMF, WTO, KIPO, and some other international organisations and IP offices.
- University of Edinburgh — Research Assistant — Edinburgh, Sep-Dec 2017
Technical Skills
- Python
- C++
- Git
- BASH
- Perl
- Docker
Language Skills
- Bengali - Mother tongue
- English - Native
- French - Intermediate (B1)
- Hindi - Fluent
- Chinese (Mandarin) - Basic