Is It Possible to Identify TB Only By A Patient’s Cough?

Is It Possible to Identify TB Only By A Patient’s Cough?
https://www.tbonline.info/media/cache/b9/0f/b90f70edd61a0e156536c5ca02465d35.jpg

Recently I’ve been working on finding sources for my HPQ about AI assisted diagnostics methods for TB (tuberculosis), when I stumbled upon a research article that proposed the usage of AI to analyse a patient’s cough and identify differences and signs that could lead to or assist a conclusive diagnosis. You can read the paper at https://arxiv.org/abs/2310.17675.

Summary of the Article

The article initially discusses the limitations of current TB diagnostic methods, such as sputum culture and GeneXpert. While sputum culture remains the gold standard for TB diagnosis due to its high accuracy, it is a slow process, often taking weeks to yield results. GeneXpert, a cartridge-based nucleic acid amplification test, provides faster results and is more sensitive than traditional smear microscopy, but it still has limitations. It requires specialised equipment, trained personnel, and a stable power supply, making it less accessible in resource-limited settings, particularly in rural areas. Additionally, the cost of GeneXpert cartridges can be prohibitive, especially in developing regions where TB prevalence is high. These challenges create a significant gap in TB detection, leading to delays in diagnosis and treatment, which in turn increases the risk of transmission.

To address these issues, the article proposes the development of an AI-powered smartphone application that can analyse cough sounds to detect TB. This innovative approach leverages machine learning algorithms to identify acoustic patterns associated with the disease, providing a rapid, cost-effective, and widely accessible alternative to conventional diagnostic methods. The idea is that patients would simply need to download the app, record a short sample of their cough, and receive an immediate screening result—all without requiring specialised medical equipment or trained professionals. This could be a game-changer for TB control efforts, especially in remote areas where access to healthcare facilities is limited.

The researchers behind this project developed an ensemble machine learning model combining 2D-CNN (Convolutional Neural Networks) and XGBoost (Extreme Gradient Boosting) to achieve robust and accurate TB detection. CNNs are commonly used for image and audio recognition tasks, making them well-suited for analysing spectrogram representations of cough sounds, while XGBoost is known for its high predictive performance in structured data classification tasks. By combining these two approaches, the model is designed to leverage the strengths of both deep learning and traditional machine learning techniques, ensuring a balance between interpretability and high accuracy.

One of the most impressive aspects of the study is the scale and diversity of the dataset used for training the model. The researchers compiled over 700,000 cough samples from patients across seven different countries, ensuring that the model is trained on a wide variety of voices, cough intensities, and environmental conditions. This large and diverse dataset helps mitigate biases that could arise from using data from a single population, increasing the model’s generalisability and effectiveness in real-world applications.

To enhance the model’s robustness, the researchers employed Mel-spectrograms for feature extraction. Mel-spectrograms convert raw audio signals into visual representations, making it easier for CNNs to identify patterns that might indicate TB-related respiratory characteristics. Additionally, they implemented data augmentation techniques to make the model more resilient to variations such as background noise, microphone quality differences, and variations in cough intensity. This is a crucial step in ensuring that the model performs well outside controlled environments, where patients might be recording their coughs in noisy or unpredictable settings.

The final model achieved an AUROC (Area Under the Receiver Operating Characteristic Curve) of 88%, which is a strong result for a screening tool. AUROC is a key metric used to evaluate diagnostic models, measuring their ability to distinguish between TB-positive and TB-negative cases. An AUROC of 88% exceeds the World Health Organization’s (WHO) minimum requirements for TB screening tests, suggesting that this AI-assisted approach could be a viable complementary tool for early detection and mass screening efforts.

Advantages of Analysing the Patient’s Cough

The most obvious advantage of this diagnostic method is its accessibility. With the ability to easily install an app and self-administer a test for TB, individuals can take control of their health without the need to visit a healthcare facility. This is particularly beneficial in remote or underserved areas where access to medical services may be limited. Furthermore, the elimination or significant reduction in the need for expensive medical equipment makes this method a highly cost-effective option. The reliance on commonly available technology such as smartphones means that more people can be reached, potentially increasing early detection rates and allowing for timely intervention.

Another key benefit is the speed of the diagnostic process. With results available in as little as 15 seconds, patients can quickly learn about their TB status. This rapid turnaround is crucial in managing and controlling the spread of tuberculosis, as it allows for prompt isolation and treatment of positive cases, thereby reducing the risk of transmission. The quick result delivery also minimises the anxiety and uncertainty that patients often experience while waiting for test outcomes.

Additionally, this test is very non-invasive when compared to others. Traditional diagnostic methods for TB, such as sputum tests and chest X-rays, can be uncomfortable or even distressing for some patients. In contrast, the requirement is only to produce a cough. This non-invasiveness not only improves patient compliance but also reduces the risk of complications associated with more invasive procedures.

The cost-effectiveness of this method is also improved by its deployment via a mobile app. This approach drastically reduces the need for establishing and maintaining expensive infrastructure, such as specialised diagnostic centres and equipment. Consequently, the overall expenses related to TB screening are significantly lowered, making it a viable option for large-scale public health initiatives. The low cost of implementation could be especially advantageous for governments and non-profit organisations working in regions with high TB prevalence but limited financial resources.

Moreover, there is a potential for integration with existing digital health systems which would improve this test further. The app-based nature of this diagnostic tool allows for seamless incorporation into telemedicine platforms and electronic health records, facilitating better tracking and management of TB cases. Such integration could enable real-time data collection and analysis, providing valuable insights into the spread of the disease and the effectiveness of public health interventions.

Disadvantages of Analysing the Patient’s Cough

While this AI-assisted diagnostic method has significant promise, there are limitations and challenges to consider. Technology dependency is a primary concern, as this approach relies on patients having access to smartphones and a stable internet connection—resources that might be scarce in remote or economically disadvantaged areas. This digital divide could limit the method’s reach and effectiveness, particularly in the very regions where tuberculosis is most prevalent. Addressing this gap might involve developing offline functionality or creating alternative low-tech solutions to complement the AI model.

Audio quality issues also pose a challenge. Factors such as background noise, poor microphone quality, and varied recording environments can negatively impact the accuracy of the diagnosis. Although the developers have incorporated data augmentation techniques to help the model adapt to real-world conditions, including noisy or inconsistent audio inputs, there remains a risk that recordings taken in uncontrolled environments could compromise the tool’s effectiveness. Continued refinement and testing in diverse settings are essential to ensure reliability across different scenarios.

Additionally, the scope of this diagnostic tool is relatively limited. It is primarily designed for initial screening and diagnosis, without the capacity to monitor disease progression, assess treatment efficacy, or detect drug-resistant strains of TB. To provide comprehensive care, it would need to be integrated with traditional diagnostic methods and regular clinical follow-ups. Exploring ways to expand the tool’s capabilities or developing complementary technologies could address this shortfall, but it would require significant additional research and development.

Lastly, there is the issue of potential bias in the model’s performance across different populations. While the dataset includes over 700,000 cough samples from seven countries, it’s not clear whether it fully represents the global diversity needed for consistent accuracy across various demographic and geographic groups. Differences in cough characteristics due to cultural, genetic, or environmental factors could influence how well the model performs in different regions. This isn’t explicitly discussed in the article so it is only an assumption – it is possible that the model was trained on samples from seven diverse countries.

My Stance On This Test

The test has incredible promise and could revolutionise the diagnostics process of TB. But the test likely needs confirmation from a pathologist to be important. As the test has a higher sensitivity than specificity, any negative results should be checked by a pathologist to be sure.

Thanks for reading!

– Hamd Waseem

Hamd Waseem
Hamd Waseem
Articles: 5