A machine learning approach to identifying Salmonella stress response genes in isolates from poultry processing

点击次数:279   下载次数:0
外文摘要:
We explored the potential of machine learning to identify significant genes associated with Salmonella stress response during poultry processing using whole genome sequencing (WGS) data. The Salmonella isolates (n = 177) used in this study were obtained from various chicken sources (skin before chiller, chicken carcass before chiller, frozen chicken, and post-chill chicken carcass). Six machine learning algorithms (random forest, neural network, cost-sensitive learning, logit boost, and support vector machine linear and radial kernels) were trained on Salmonella WGS data, and model fit was assessed using standard evaluation metrics such as the area under the receiver operating characteristic (AUROC) curve and confusion matrix statistics. All models achieved high performances based on the AUROC metric, with logit boost showing the best performance with an AUROC score of 0.904, sensitivity of 0.889, and specificity of 0.920. The significant genes identified included ybtX, which encodes a Yersiniabactin-associated zinc transporter, and the transferase-encoding genes yccK and thiS. Addi-tionally, genes coding for cold (cspA, cspD, and cspE) and heat shock (rpoH and rpoE) responses were identified. Other significant genes included those involved in lipopolysaccharide biosynthesis (irp1, waaD, rfc, and rfbX), DNA repair and replication (traI), biofilm formation (ccdA and fyuA), and cellular metabolism (irtA).
外文关键词:machine learning;Salmonella;Stress response;Whole genome sequencing;Poultry processing
作者:Benefo, Edmund O;Karanth, Shraddha;Pradhan, Abani K
作者单位:Univ Maryland
期刊名称:FOOD RESEARCH INTERNATIONAL
期刊影响因子:0.0
出版年份:2024
出版刊次:175
原文传递申请:江苏省科技资源(工程技术文献)统筹服务平台

  1. 编译服务:智慧农业
  2. 编译者:虞德容
  3. 编译时间:2025-04-07