Mr Tathagata Bhattacharjee

ALPHA Network Data Analyst

United Kingdom

I joined the school in 2017 as a Data Analyst for ALPHA Network. I hold a BSc Honours degree in Physics, complemented by a Master of Business Administration (MBA) degree with a specialization in Information Technology. I am an enthusiastic staff research degree student pursuing my PhD study in parallel with my professional commitments at the institution.

Early in my career, while consulting for the Vadu Health and Demographic Surveillance System, I played a pivotal role in introducing electronic field data capture systems. Initially, we utilized laptop computers and later transitioned to Android tablets. This initiative marked one of the pioneering deployments of large-scale longitudinal field surveillance using tablets in India. I also had the privilege of serving as a member of the Scientific Advisory Committee for the INDEPTH Network. Subsequently, I assumed a critical role in research data management in INDEPTH's iSHARE2 project, dedicated to data harmonization and sharing. Furthermore, I led the technical team responsible for the National Surveillance System for Enteric Fever in India, a testament to my commitment to advancing public health. These valuable experiences laid a strong foundation for my journey before joining the school.

In my current professional capacity, I serve as a Data Analyst/Scientist for two prominent networks: the ALPHA Network ( and the INSPIRE Network ( & ). My primary focus revolves around the intricate world of data science.

Within the ALPHA Network, I have made substantial contributions by designing and implementing an ETL (Extract, Transform, Load) pipeline using the powerful Pentaho Data Integration tool. This pipeline efficiently manages ALPHA site data within the innovative Centre-in-a-Box (CiB) environment, consistently meeting precise data specifications. Additionally, I have spearheaded the development of process automation pipelines within the ALPHA server, spanning from data uploads to meticulous data quality checks, comprehensive quality reports, and data harmonization.

In the context of the INSPIRE Network, I've harnessed my expertise in OHDSI tools to seamlessly migrate data from ALPHA data specifications to the OMOP CDM (Observational Medical Outcomes Partnership Common Data Model). I am deeply involved in the critical task of harmonizing COVID-19 data sourced from the Integrated Disease Surveillance and Response in the African Region into the OMOP CDM. I've adeptly configured the INSPIRE platform-as-a-service (PaaS) on Microsoft Azure cloud services to streamline and accelerate these complex data processing tasks. Generated a synthetic dataset for WHO Integrated Disease Surveillance and Response (IDSR) for the Africa Region, focusing on COVID-19. This dataset has been used for the development of the ETL pipeline to migrate data from IDSR format to the standardised Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). The details of this work are made available on a GitHub repository (…). Work on mental health, other infectious and non-communicable diseases are in progress on the INSPIRE platform.

In summary, my academic foundation and experience have provided me with a strong knowledge base, which I apply to drive impactful advancements in data science and processing within the population health and research domains.


Department of Population Health
Faculty of Epidemiology and Population Health


I have been instructing ETL techniques as part of the Health Data Management Module for the MSc Health Data Science program from the academic year 2020-2021 through 2022-2023.


My research centers on the application of Machine Learning techniques to facilitate robust data record linkage between Health and Demographic Surveillance Systems (HDSS) and HIV clinic datasets, all within the same geographical regions. This ambitious endeavour aims to address critical gaps in healthcare and epidemiological research.

The datasets under scrutiny are sourced from an HDSS site in Tanzania. These datasets represent a rich and diverse repository of healthcare information. When effectively interconnected, they hold the potential to uncover profound insights into the ever-evolving dynamics of public health, especially within the context of HIV trends.

I am fortunate to have the guidance of mentors throughout this intellectual journey. Professor Jim Todd and Dr. Emma Slaymaker from the school alongside Dr. Chodziwadziwa Kabudula from the University of the Witwatersrand, South Africa, have been providing me with invaluable support from the Department of Population Health, Faculty of Epidemiology and Population Health. Their expertise and unwavering commitment plays a pivotal role in shaping the direction and impact of my research.

Selected Publications

INSPIRE datahub: a pan-African integrated suite of services for harmonising longitudinal population health data using OHDSI tools.
BHATTACHARJEE, T; Kiwuwa-Muyingo, S; Kanjala, C; Maoyi, ML; Amadi, D; Ochola, M; Kadengye, D; Gregory, A; Kiragga, A; Taylor, A; Greenfield, J; SLAYMAKER, E; TODD, J; INSPIRE Network,;
Frontiers in Digital Health
D7.2 Population health resource library
and training package
TODD, J; TOMLIN, K; BHATTACHARJEE, T; AMADI, D; Greenfield, J; Fils, D; Mailosi, D; Kanjala, C;
Enabling data sharing and utilization for African population health data using OHDSI tools with an OMOP-common data model.
Kiwuwa-Muyingo, S; TODD, J; BHATTACHARJEE, T; Taylor, A; Greenfield, J;
Frontiers in public health
INSPIRE EA IT Infrastructure on the Cloud
CODATA - Committee on Data of the International Science Council
ALPHA to OMOP Data and Vocabulary Mapping
BHATTACHARJEE, T; Greenfield, J;
CODATA - Committee on Data of the International Science Council
Validating a GPS-based approach to detect health facility visits against maternal response to prompted recall survey
Marsh, A; Hirve, S; Lele, P; Chavan, U; BHATTACHARJEE, T; Nair, H; Campbell, H; Juvekar, S;
Journal of Global Health
Determinants and patterns of care-seeking for childhood illness in rural Pune District, India
Marsh, A; Hirve, S; Lele, P; Chavan, U; BHATTACHARJEE, T; Nair, H; Juvekar, S; Campbell, H;
Journal of Global Health
Demographic surveillance over 12 years helps elicit determinants of low birth weights in India
Apte, A; Patil, R; Lele, P; Choudhari, B; BHATTACHARJEE, T; Bavdekar, A; Juvekar, S;
PLoS One
Ethical considerations in the use of GPS-based movement tracking in health research - lessons from a care-seeking study in rural west India
Apte, A; Ingole, V; Lele, P; Marsh, A; BHATTACHARJEE, T; Hirve, S; Campbell, H; Nair, H; Chan, S; Juvekar, S;
Journal of Global Health
Profile: Vadu Health and Demographic Surveillance System Pune, India
Patil, R; Roy, S; Ingole, V; BHATTACHARJEE, T; Chaudhary, B; Lele, P; Hirve, S; Juvekar, S; VaduHDSS team,;
Journal of Global Health
See more information