Locations:
Search IconSearch
November 6, 2024/Digestive/Research

Model Uses Machine Learning to Predict Patients at Risk of Gastric Cancer

Promising results could lead to improved screening, better outcomes

Physician speaking with patient by computer

Gastric cancer is relatively uncommon, but it is associated with poor survival and disproportionately affects all minority groups. Although not enough people are affected by gastric cancer in the U.S. to make routine screening practical, early detection significantly improves survival. As a result, being able to identify high-risk patients who would benefit from early screening could dramatically improve survival rates while simultaneously ensuring that screening resources are being used efficiently

Advertisement

Cleveland Clinic is a non-profit academic medical center. Advertising on our site helps support our mission. We do not endorse non-Cleveland Clinic products or services. Policy

New research from Cleveland Clinic describes how a machine learning model could help clinicians better identify these high-risk patients. The model, which used data from electronic health records (EHR), predicted patient risk of gastric cancer with surprising accuracy, says senior author, Michelle Kang Kim, MD, PhD, Chair of the Gastroenterology, Hepatology & Nutrition Department at Cleveland Clinic.

A need for better screening

In the United States, gastric cancer is only assessed for if the patient is having symptoms like abdominal pain or vomiting blood. The disease typically has poor outcomes with an overall 30% survival rate. However, in Asia, where the condition is more common, gastric cancer is screened for routinely, and survival rates are much higher, notes Dr. Kim.

“When you catch it early, the 5-year survival is as high as 95%,” she says, “so there’s real value to screening and detecting this cancer early when we can.”

Dr. Kim adds that improving detection would also address a severe healthcare disparity since gastric cancer disproportionately affects immigrant and minority populations in the U.S.

Study design

With the study, the research group wanted to know if they could use EHR information to identify patients who would benefit from screening. They used machine learning to develop a model that could predict which patients were most at risk.

The strongest model achieved an Area Under the Receiver Operating Characteristic Curve (AUC ROC) score of 0.78. More than 11,000 Ohio-based patients were included in the development set, including 567 who had gastric cancer. Notably, the model did not include Helicobacter pylori (even though it is a known risk factor) because this data was missing as patients are not tested for it routinely.

Advertisement

“To our surprise, the model performed pretty well, even without such an important variable,” says Dr. Kim.

Even more surprising, researchers were able to validate the model using data from Cleveland Clinic patients in Florida.

“We were thrilled with that result because you’re not often able to see a model that has equal performance in a different population from where it was developed,” she says.

Looking ahead

Dr. Kim notes that the model was based only on “relatively crude” demographic data like age, sex and race as well as clinical variables like medical history and laboratory values.

“This demonstrated that relatively simple variables can still give you a reasonably performing model, and I think that’s something that deserves further exploration,” she explains. “In this model, those factors such as age were actually some of the most important ones.”

The group plans to continue refining the model, including working with other medical centers in different regions to expand the patient population.

“Ultimately, one of our biggest takeaways from the study is that this approach is translatable, and similar models could help improve screening for other diseases,” says Dr. Kim. “Our model shows how clinical data can guide screening recommendations for high-risk groups. If we can do this with gastric cancer, we can take this same approach with other relatively uncommon diseases.”

Dr. Kim presented the project, “Development and External Validation of a Machine Learning-Based Gastric Cancer Prediction Model using Electronic Health Record Data,” at the 2024 American College of Gastroenterology Annual Scientific Meeting.

Advertisement

Related Articles

Physician speaking with patient
May 20, 2025/Digestive/Research
Luminal GI Cancers Driving the Rise in Early-Onset GI Cancer Rates

Greater awareness among young patients is needed

Physician with RFA device
May 19, 2025/Digestive/Research
New Approach Reduces Mortality, Complications in Malignant Biliary Obstruction

Combining RFA with stenting improves survival rates, reduces post-ERCP complications

Dr. Liska speaking with researcher
May 14, 2025/Digestive/Research
Rectal Cancer Patients with IBD Do Not Respond as Well to Neoadjuvant Therapy

Poor response may be due to different tumor biology

Dr. Gorgun in surgery
May 13, 2025/Digestive/Research
Rectal Cancer: Local Regrowth After TNT Does Not Increase Risk of Distant Metastasis

Study sheds light on distant metastasis rates between upfront surgery and Watch & Wait strategies

Dr. Holubar in surgery
May 12, 2025/Digestive/Research
New Study Highlights LMWH's Role in Lowering VTE Incidence After IBD Surgery

Findings help close the knowledge gap around VTE practice patterns

Blood being drawn
May 6, 2025/Digestive/Research
Study Suggests Non-Biopsy Diagnosis May Be Accurate for a Select Group of Adults with Celiac Disease

Potential for non-invasive methods to transform adult celiac disease diagnoses

May 5, 2025/Digestive/Research
Bariatric Surgery Reduces CKD Risk in MASH/MASLD Patients with Obesity and Type 2 Diabetes

Consider each patient's unique disease progression and treatment goals when choosing a strategy

Ad