Research Output

Publications

Full publication record for Vajira Lasantha Thambawita on Google Scholar — covering medical imaging, cardiac AI, reproductive medicine, sports analytics, and generative AI.

Scholar Metrics

Citation Statistics

Statistics sync weekly via GitHub Actions. Last updated: 2025-03-25. View live on Google Scholar →

Total Citations
h-index
i10-index
96 Total Papers in this database

Impact Areas

Research Highlights

Large-Scale Datasets

Created and co-created multiple widely-used benchmark datasets for gastrointestinal endoscopy, sperm analysis, and cardiac imaging.

Generative AI for Medicine

Pioneered use of GANs and diffusion models to generate synthetic medical data, addressing data scarcity in healthcare AI.

Cardiac AI

Developed DeepFake ECG technology using GANs to address privacy in cardiac data sharing, with broad impact on medical privacy research.

Reproductive Medicine AI

Advanced AI methods for sperm analysis and fertility prediction, including tracking, morphology assessment, and ICSI procedure automation.

Most Cited

Top Cited Publications

Ordered by citation count, synced weekly from Google Scholar.

  1. 1
    HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy

    Hanna Borgli, Vajira Thambawita, Pia H Smedsrud, Steven Hicks, Debesh Jha, et al.

    Scientific Data 2020 syncing…
  2. 2
    DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine

    Vajira Thambawita, Jonas L Isaksen, Steven A Hicks, Jonas Ghouse, Gustav Ahlberg, et al.

    Scientific Reports 2021 syncing…
  3. 3
    Kvasir-Capsule, a video capsule endoscopy dataset

    Pia H Smedsrud, Vajira Thambawita, Steven A Hicks, Henrik Gjestang, Oda Olsen Nedrejord, et al.

    Scientific Data 2021 syncing…
  4. 4
    SinGAN-Seg: Synthetic training data generation for medical image segmentation

    Vajira Thambawita, Pegah Salehi, Sajad Amouei Sheshkal, Steven A Hicks, et al.

    PLoS ONE 2022 syncing…
  5. 5
    An extensive study on cross-dataset bias and evaluation metrics interpretation for machine learning applied to gastrointestinal tract abnormality classification

    Vajira Thambawita, Debesh Jha, Hugo Lewi Hammer, Håvard D Johansen, Dag Johansen, Pål Halvorsen, Michael A Riegler

    ACM Transactions on Computing for Healthcare 2020 syncing…
  6. 6
    Machine learning-based analysis of sperm videos and participant data for male fertility prediction

    Steven A Hicks, Jorunn M Andersen, Oliwia Witczak, Vajira Thambawita, Pål Halvorsen, et al.

    Scientific Reports 2019 syncing…
  7. 7
    On evaluation metrics for medical applications of artificial intelligence

    Steven A Hicks, Inga Strümke, Vajira Thambawita, Malek Hammou, Michael A Riegler, Pål Halvorsen, Sravanthi Parasa

    Scientific Reports 2022 syncing…
  8. 8
    Impact of image resolution on deep learning performance in endoscopy image classification

    Vajira Thambawita, Inga Strümke, Steven A Hicks, Pål Halvorsen, Sravanthi Parasa, Michael A Riegler

    Diagnostics 2021 syncing…
  9. 9
    VISEM-Tracking, a human spermatozoa tracking dataset

    Vajira Thambawita, Steven A Hicks, Andrea M Storås, Thu Nguyen, Jorunn M Andersen, et al.

    Scientific Data 2023 syncing…
  10. 10
    Explaining deep neural networks for knowledge discovery in electrocardiogram analysis

    Steven A Hicks, Jonas L Isaksen, Vajira Thambawita, Jonas Ghouse, Gustav Ahlberg, et al.

    Scientific Reports 2021 syncing…

Complete Record

All Publications

96 papers grouped by year. Includes journal articles, conference papers, datasets, and preprints.

2024

5 publications
  1. Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge

    Scientific Reports Nature Publishing Group UK London
  2. Advancing sleep detection by modelling weak label sets: A novel weakly supervised learning approach

    arXiv preprint arXiv:2402.17601
  3. Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024

    European Conference on Information Retrieval Springer Nature Switzerland Cham
  4. SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset

    arXiv preprint arXiv:2405.07354
  5. MediaEval 2023: Multimedia Benchmark Workshop 2023, Working Notes Proceedings of the MediaEval 2023 Workshop Amsterdam, The Netherlands and Online, 1-2 February 2024

    Sl: CEUR

2023

14 publications
  1. VISEM-Tracking, a human spermatozoa tracking dataset

    Scientific Data Nature Publishing Group UK London
  2. GridHTM: Grid-Based Hierarchical Temporal Memory for Anomaly Detection in Videos

    Sensors MDPI
  3. ScopeSense: An 8.5-month sport, nutrition, and lifestyle lifelogging dataset

    International Conference on Multimedia Modeling Springer International Publishing Cham
  4. ImageCLEF 2023 Highlight: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications

    European Conference on Information Retrieval Springer Nature Switzerland Cham
  5. Multimedia datasets: challenges and future possibilities

    International Conference on Multimedia Modeling Springer Nature Switzerland Cham
  6. Mask-conditioned latent diffusion for generating gastrointestinal polyp images

    Proceedings of the 4th ACM Workshop on Intelligent Cross-Data Analysis and Retrieval
  7. Usefulness of Heat Map Explanations for Deep-Learning-Based Electrocardiogram Analysis

    Diagnostics MDPI
  8. RePolyp: A Framework for Generating Realistic Colon Polyps with Corresponding Segmentation Masks using Diffusion Models

    2023 IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS) IEEE
  9. An objective validation of polyp and instrument segmentation methods in colonoscopy through Medico 2020 polyp segmentation and MedAI 2021 transparency challenges

    arXiv preprint arXiv:2307.16262
  10. Overview of imageclefmedical 2023–medical visual question answering for gastrointestinal tract

    CLEF2023 Working Notes, CEUR Workshop Proceedings, CEUR-WS. org, Thessaloniki, Greece
  11. Overview of ImageCLEF 2023: Multimedia retrieval in medical, socialmedia and recommender systems applications

    Experimental IR Meets Multilinguality, Multimodality, and Interaction, Proceedings of the 14th International Conference of the CLEF Association (CLEF 2023), Springer Lecture Notes in Computer Science LNCS, Thessaloniki, Greece
  12. Cellular, a cell autophagy imaging dataset

    Scientific data Nature Publishing Group UK London
  13. Working Notes Proceedings of the MediaEval 2022 Workshop

    CEUR Workshop Proceedings 3583
  14. An Open-Access Dataset of Hospitalized Cardiac-Arrest Patients: Machine-Learning-Based Predictions Using Clinical Documentation

    BioMedInformatics MDPI

2022

25 publications
  1. On evaluation metrics for medical applications of artificial intelligence

    Scientific reports Nature Publishing Group UK London
  2. SinGAN-Seg: Synthetic training data generation for medical image segmentation

    PloS one Public Library of Science San Francisco, CA USA
  3. Meta-learning with implicit gradients in a few-shot setting for medical image segmentation

    Computers in Biology and Medicine Pergamon
  4. MMSys' 22 Grand Challenge on AI-based Video Production for Soccer

    arXiv preprint arXiv:2202.01031
  5. # ESHREjc report: on the road to preconception and personalized counselling with machine learning models

    Human Reproduction Oxford University Press
  6. PolypConnect: Image inpainting for generating realistic gastrointestinal tract images with polyps

    2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS) IEEE
  7. Segmentation Consistency Training: Out-of-Distribution Generalization for Medical Image Segmentation

    2022 IEEE International Symposium on Multimedia (ISM) IEEE
  8. Grid HTM: Hierarchical Temporal Memory for Anomaly Detection in Videos

    arXiv preprint arXiv:2205.15407
  9. Synthesizing a talking child avatar to train interviewers working with maltreated children

    Big Data and Cognitive Computing MDPI
  10. Chapter 4 Smittestopp analytics: Analysis of position data

    Smittestopp− A Case Study on Digital Contact Tracing Springer International Publishing Cham
  11. P-108 Real-time deep learning based multi object tracking of spermatozoa in fresh samples

    Human Reproduction Oxford University Press
  12. P-272 Automatic Tracking of the ICSI procedure using Deep Learning

    Human Reproduction Oxford University Press
  13. P-243 Automating tracking of cell division for human embryo development in time lapse videos

    Human Reproduction Oxford University Press
  14. Njord: a fishing trawler dataset

    Proceedings of the 13th ACM Multimedia Systems Conference
  15. Reproducibility Companion Paper: Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies

    Proceedings of the 30th ACM International Conference on Multimedia
  16. Automating tracking of cell division for human embryo development in time lapse videos

    HUMAN REPRODUCTION OXFORD UNIV PRESS GREAT CLARENDON ST, OXFORD OX2 6DP, ENGLAND
  17. Real-time deep learning based multi object tracking of spermatozoa in fresh samples

    HUMAN REPRODUCTION OXFORD UNIV PRESS GREAT CLARENDON ST, OXFORD OX2 6DP, ENGLAND
  18. Automatic Tracking of the ICSI procedure using Deep Learning

    HUMAN REPRODUCTION OXFORD UNIV PRESS GREAT CLARENDON ST, OXFORD OX2 6DP, ENGLAND
  19. MLC at HECKTOR 2022: The Effect and Importance of Training Data When Analyzing Cases of Head and Neck Tumors Using Machine Learning

    3D Head and Neck Tumor Segmentation in PET/CT Challenge Springer Nature Switzerland Cham
  20. Poster Session 1Baseline filtering alleviates generalization issues for neural networks for electrocardiogram analysis

    Journal of Electrocardiology Churchill Livingstone
  21. Biomedical image analysis competitions: The state of current participation practice

    arXiv preprint arXiv:2212.08568
  22. Medico multimedia task at mediaeval 2022: Transparent tracking of spermatozoa

    Proceedings of MediaEval 2022 CEUR Workshop
  23. Automatic Unsupervised Clustering of Videos of the Intracytoplasmic Sperm Injection (ICSI) Procedure

    Symposium of the Norwegian AI Society Springer International Publishing Cham
  24. On evaluation metrics for medical applications of artificial intelligence.

    Articles, Abstracts, and Reports
  25. Overview of the ImageCLEF 2022: Multimedia retrieval in medical, social media and nature applications

    International Conference of the Cross-Language Evaluation Forum for European Languages Springer International Publishing Cham

2021

29 publications
  1. Kvasir-Capsule, a video capsule endoscopy dataset

    Scientific Data Nature Publishing Group UK London
  2. Htad: A home-tasks activities dataset with wrist-accelerometer and audio features

    MultiMedia Modeling: 27th International Conference, MMM 2021, Prague, Czech Republic, June 22–24, 2021, Proceedings, Part II 27 Springer International Publishing
  3. Kvasir-instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy

    MultiMedia Modeling: 27th International Conference, MMM 2021, Prague, Czech Republic, June 22–24, 2021, Proceedings, Part II 27 Springer International Publishing
  4. Explaining deep neural networks for knowledge discovery in electrocardiogram analysis

    Scientific reports Nature Publishing Group UK London
  5. The EndoTect 2020 challenge: evaluation and comparison of classification, segmentation and inference time for endoscopy

    Pattern Recognition. ICPR International Workshops and Challenges: Virtual Event, January 10-15, 2021, Proceedings, Part VIII Springer International Publishing
  6. A comprehensive analysis of classification methods in gastrointestinal endoscopy imaging

    Medical image analysis Elsevier
  7. Fr615 impact of image resolution on convolutional neural networks performance in gastrointestinal endoscopy

    Gastroenterology WB Saunders
  8. ID: 3523524 Data augmentation using generative adversarial networks for creating realistic artificial colon polyp images: validation study by endoscopists

    Gastrointestinal Endoscopy Mosby
  9. Few-shot segmentation of medical images based on meta-learning with implicit gradients

    arXiv preprint arXiv:2106.03223
  10. DeepSynthBody: the beginning of the end for data deficiency in medicine

    2021 International Conference on Applied Artificial Intelligence (ICAPAI) IEEE
  11. Divergentnets: Medical image segmentation by network ensemble

    arXiv preprint arXiv:2107.00283
  12. Using 3D convolutional neural networks for real-time detection of soccer events

    International Journal of Semantic Computing World Scientific Publishing Company
  13. A self-learning teacher-student framework for gastrointestinal image classification

    2021 IEEE 34th International Symposium on Computer-Based Medical Systems (CBMS) IEEE
  14. P–029 Identification of spermatozoa by unsupervised learning from video data

    Human Reproduction Oxford University Press
  15. Multimodal virtual avatars for investigative interviews with children

    Proceedings of the 2021 ACM Workshop on Intelligent Cross-Data Analysis and Retrieval
  16. Pyramidal segmentation of medical images using adversarial training

    Proceedings of the 2021 ACM Workshop on Intelligent Cross-Data Analysis and Retrieval
  17. Identification of spermatozoa by unsupervised learning from video data

    HUMAN REPRODUCTION OXFORD UNIV PRESS GREAT CLARENDON ST, OXFORD OX2 6DP, ENGLAND
  18. Reproducibility companion paper: Norm-in-norm loss with faster convergence and better performance for image quality assessment

    Proceedings of the 29th ACM International Conference on Multimedia
  19. Medai: Transparency in medical image segmentation

    Nordic Machine Intelligence
  20. Artificial Intelligence in Medicine: Gastroenterology

    Artificial Intelligence in Medicine Springer International Publishing Cham
  21. DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine

    Scientific reports Nature Publishing Group UK London
  22. Impact of image resolution on deep learning performance in endoscopy image classification: An experimental study using a large dataset of endoscopic images

    Diagnostics MDPI
  23. Ai-based video clipping of soccer events

    Machine Learning and Knowledge Extraction MDPI
  24. Automated event detection and classification in soccer: The potential of using multiple modalities

    Machine Learning and Knowledge Extraction MDPI
  25. Automated clipping of soccer events using machine learning

    2021 IEEE International Symposium on Multimedia (ISM) IEEE
  26. Emotional Mario Task at MediaEval 2021.

    MediaEval
  27. Medico Multimedia Task at MediaEval 2021: Transparency in Medical Image Segmentation.

    MediaEval
  28. Cise Midoglu, Evi Zouganeli, Dag Johansen, Michael Alexander Riegler, and Pål Halvorsen. 2021. Automated Event Detection and Classification in Soccer: The Potential of Using Multiple Modalities

    Machine Learning and Knowledge Extraction
  29. Impact of image resolution on convolutional neural networks performance in gastrointestinal endoscopy

    Gastroenterology WB SAUNDERS CO-ELSEVIER INC 1600 JOHN F KENNEDY BOULEVARD, STE 1800 …

2020

9 publications
  1. HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy

    Scientific data Nature Publishing Group UK London
  2. Pmdata: a sports logging dataset

    Proceedings of the 11th ACM Multimedia Systems Conference
  3. Psykose: A motor activity database of patients with schizophrenia

    2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS) IEEE
  4. Toadstool: A dataset for training emotional intelligent machines playing Super Mario Bros

    Proceedings of the 11th ACM Multimedia Systems Conference
  5. An extensive study on cross-dataset bias and evaluation metrics interpretation for machine learning applied to gastrointestinal tract abnormality classification

    ACM Transactions on Computing for Healthcare ACM New York, NY, USA
  6. ACM multimedia biomedia 2020 grand challenge overview

    Proceedings of the 28th ACM International Conference on Multimedia
  7. Pyramid-focus-augmentation: medical image segmentation with step-wise focus

    arXiv preprint arXiv:2012.07430
  8. Real-time detection of events in soccer videos using 3D convolutional neural networks

    2020 IEEE International Symposium on Multimedia (ISM) IEEE
  9. Vid2Pix-A Framework for Generating High-Quality Synthetic Videos

    2020 IEEE International Symposium on Multimedia (ISM) IEEE

2019

5 publications
  1. Unsupervised preprocessing to improve generalisation for medical image classification

    2019 13th International Symposium on Medical Information and Communication Technology (ISMICT) IEEE
  2. GANEx: A complete pipeline of training, inference and benchmarking GAN experiments

    2019 International Conference on Content-Based Multimedia Indexing (CBMI) IEEE
  3. Machine learning-based analysis of sperm videos and participant data for male fertility prediction

    Scientific reports Nature Publishing Group UK London
  4. Extracting temporal features into a spatial domain using autoencoders for sperm video analysis

    arXiv preprint arXiv:1911.03100
  5. Stacked dense optical flows and dropout layers to predict sperm motility and morphology

    arXiv preprint arXiv:1911.03086

2018

3 publications
  1. The medico-task 2018: Disease detection in the gastrointestinal tract using global features and deep learning

    arXiv preprint arXiv:1810.13278
  2. Using preprocessing as a tool in medical image detection

  3. CEUR WORKSHOP PROCEEDINGS

2016

2 publications
  1. An optimized Parallel Failure-less Aho-Corasick algorithm for DNA sequence matching

    2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS) IEEE
  2. To use or not to use: CPUs' cache optimization techniques on GPGPUs

    2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS) IEEE

2014

1 publication
  1. To use or not to use: Graphics processing units (GPUs) for pattern matching algorithms

    7th International Conference on Information and Automation for Sustainability IEEE

2013

1 publication
  1. Graphics Processing Units: To Use or Not to Use?

    The University of Peradeniya

2011

2 publications
  1. Low Cost Telepresence Robot

    University of Peradeniya
  2. BUILDING A LOW-COST TELEPRESENCE ROBOT WITH GENERIC HARDWARE