The AI Commons Project is a proof of concept of a new methodology of developing Artificial Intelligence solutions that allows anyone, anywhere to benefit from the possibilities that AI can provide. The project aims to increase/improve the accessibility, reproducibility, contextualization and enhancement of Artificial Intelligence solutions globally and especially in emerging markets.
The project aims to demonstrate how a global community of AI experts can learn and co-create mutually beneficial solutions with the opportunity for cross-county incremental enhancement.
Data Science Nigeria
Rising Odegua (Data Science Nigeria)
Nelson Ogbeide; Ojeabulu O. Gift; Comfort Igboko; Caleb Emelike; Precious Cadeton
Pneumonia is an infection that affects one or both lungs and it is known to cause premature death around the world. World Health Organization(WHO) estimated that about 4 million death occur annually from air pollution diseases including pneumonia
Research has shown that children and people older than 65 are the most vulnerable to the disease and continents with a high number of adult and children ratio like Africa and Asia are mostly affected.
A doctor examines the chest x ray of a paitent to determine if the patient has pneumonia
Yes
Malaria detection using machine learning
Cancer detection using machine learning
The solution is a pneumonia detection model. It helps health practitioners in hospitals, clinics and laboratories to quickly detect whether a patient has pneumonia, given the patient’s chest X-ray. The first and only release ofthe solution was in 2019.
See paper Here
The outcome of the solution is a prediction of whether the patient has pneumonia or not if a chest X-ray is provided as input
Health practitioners
Patients
A data scientist/ machine learning engineer is needed to build the prediction model and a domain expert (such as a thoracic surgeon) is needed to provide technical details on how the diseases work and are being identified
Small amount of training data
More data should be used for training and evaluation
No. We hope to keep it updated whenever the solution is retrained.
A simple use case: When a patient shows symptoms of pneumonia, her chest xray is fed into the model to predict in seconds the likelihood of the patient having pneumonia. With this, doctors can help recommend drugs that could help treat the disease at early stage.
Health practitioners
The chest x ray image which is the input is uploaded into the model by a medical practitioner and the prediction of whether the patient has pneumonia or not is displayed on the screen.
N/A.
Chest X-Ray Images (Pneumonia)
The dataset contains 5,863 images in 2 categories namely Normal(Chest x ray) and infected (Chest x ray). The dataset is hosted on kaggle and maintained by the owner Paul Mooney and it does not have a datasheet.
Link to dataset: Click Here
Model date: 2019. The model was built using keras, a neural network framework in python for deep learning applications. The type of neural network is a 7-level convolutional neural network.
Basic image resize was performed on the data
No.
The dataset was split in the ratio 80:10:10 (train, validation, and test dataset). The performance metric used is mean average precision (mAP), it is a very important metric to use when building an object detection model.
The model was able to predict 95% of the test data correctly and 5% wrongly. The model accuracy, precision, recall, f1_score are 94%, 93%, 100% and 96% respectively.
40 epochs in 217 steps
The measure of statistics utlized is the classification report available in scikit-learn (python library) which showed the precision, recall , accuracy and f1_score of the model
On training the model, there was an error drop from 100% to 7%.
The average runtime is about 1-2s to predicting each sample result.
Operating system is linux, programming languuge is python3.7, keras version 1.5
Yes. Django, a python framework was used
The possible sources of bias/ unfairness weren’t analyzed.
The solution output are not easily explainable
The solution was not tested on different unseen data apart from the one used to develop the solution
It is not suitable for any solution that is not related to pneumonia.
The solution does not collect user data.
Copyright © 2020 Data Science Nigeria.