with the collaboration of Iranian Food Science and Technology Association (IFSTA)

Document Type : Research Article-en

Authors

1 Department of Information Technology Management, Faculty of Management and Economics, Tarbiat Modares University, Tehran, Iran

2 Department of Biosystems Mechanical Engineering, Faculty of Agriculture, Shahrekord University, Shahrekord, Iran

10.22067/ifstrj.2024.89599.1360

Abstract

Electronic nose is an electronic device for smell detection. The data obtained from this device are stored in the form of numbers in different columns, which are related to the data of two types of cheese namely gluten-free cheese and cheese with gluten. It is  not enough to make decisions and judge the data unless discovering the relationships and patterns between the data obtained to determine the relation of new data recorded by the device to the type of cheese, For this purpose, data mining and machine learning methods have been used in this research. Data mining includes various algorithms such as classification, clustering, and obtaining association rules. To get a better result from the data, a data mining process was performed on 105 different permutations of the models, and 13 models with the highest accuracy in understanding the relationships between the data were chosen. In this research, with data mining methods, cheese with gluten and gluten-free cheese data were classified into separate categories, and a model was created to predict the type of new input data in terms of the nature of cheese (gluten-free and with gluten). With analyzing 105 Permutations, Finally, the best suitable model to be used for data classification using the Random Forest algorithm and MinMaxScaler for scaling was selected with a prediction accuracy of 99.8% for both test and training datasets.

Keywords

Main Subjects

©2025 The author(s). This is an open access article distributed under Creative Commons Attribution 4.0 International License (CC BY 4.0)

  1. Criminisi, J.S. (2011). Decision forests: A unified framework for classification, regression, density estimation, manifold learning and semi-supervised learning. Foundations and Trends⃝R in Computer Graphics and Vision, 7(2-3), 81-227. https://doi.org/10.1561/0600000035
  2. Bhattacharya, N.T. (2008). Preemptive identification of optimum fermentation time for black tea using electronic nose. Sensors and Actuators B: Chemical, 131(1), 110-116. https://doi.org/10.1016/j.snb.2007.12.032
  3. Biau, G. (2016). A random forest guided tour. Test 25.2, 197-227. https://doi.org/10.1007/s11749-016-0481-7
  4. Breiman, L. (2001). Random forests. Machine Learning, 45:5–32.
  5. Breiman, L. (2001). Random forests. Mach. Learn.
  6. Du, W. (2002). Building decision tree classifier on private data.
  7. Fernandez, L.Z. (2023). Applications of electronic noses in cheese quality assessment. Journal of Food Science and Technology, 60(3), 1234-1245.
  8. Freund, Y.S. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119-139. https://doi.org/10.1006/jcss.1997.1504
  9. Shekari, E.M. (2024). Evaluation of the quantitative and qualitative characteristics of gluten-free chicken nuggets containing quinoa flour and hydroxypropyl methyl cellulose (HPMC). (HPMC). Iranian Food Science & Technology Research Journal/Majallah-i Pizhūhishhā-yi ̒Ulūm va Sanāyi̒-i Ghaz̠āyī-i Īrān, 20(1), 47-62.
  10. Hu, W.H. (2008). Adaboost-based algorithm for network intrusion detection. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 38(2), 577-583. https://doi.org/10.1109/TSMCB.2007.914695
  11. Karoui, R. (2011). Fluorescence spectroscopy measurement for quality assessment of food systems—a review. Food and Bioprocess Technology, 4, 364-386. https://doi.org/10.1007/s11947-010-0370-0
  12. Persaud, K., & Dodd, G. (1982). Analysis of discrimination mechanisms in the mammalian olfactory system using a model nose. Nature, 299, 352-355. https://doi.org/10.1038/299352a0
  13. Qi, Y. (2012). Random forest for bioinformatics. Ensemble machine learning. Springer, Boston, MA, 307-323. https://doi.org/10.1007/978-1-4419-9326-7_11
  14. Quinlan, J.R. (1993). 5, Programs for Machine Learning. Morgan Kaufmann San Mateo Ca.
  15. Ren, S.C. (2015). Global refinement of random forest. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
  16. Schapire, R.E. (197-227). The strength of weak learnability. Machine Learning, 5(2), 1990. https://doi.org/10.1007/BF00116037
  17. Schapire, R.E. (2013). Explaining adaboost. In: Empirical inference. Springer, Berlin, Heidelberg, p. 37-52.
  18. Stein, G.C. (2005). Decision tree classifier for network intrusion detection with GA-based feature selection. Proceedings of the 43rd annual Southeast regional conference-Volume 2. https://doi.org/10.1145/1167253.1167288
  19. Thompson, T.S. (2023). Advances in gluten detection methods for celiac disease management. Nutrients, 15(2), 789.
  20. Vapnik, V. (1998). Statistical Learning Theory. John Wiley and Sons.
  21. Wang, F.L. (2019). Feature learning viewpoint of AdaBoost and a new algorithm. IEEE Access, 7, 149890-149899. https://doi.org/10.1109/ACCESS.2019.2947359
  22. Wilson, A.D. (2009). Applications and advances in electronic-nose technologies. Sensors, 9(7), 5099-5148. https://doi.org/10.3390/s90705099
  23. Wilson, A.D. (2013). Diverse applications of electronic-nose technologies in agriculture and forestry. Sensors, 13(2), 2295-2348. https://doi.org/10.3390/s130202295
  24. Yu, H.L. (2024). Rapid detection of gluten contamination in food products using advanced sensor technologies. Food Chemistry, 420, 136042.
  25. Zhang, Y.C. (2023). Data mining approaches in electronic nose technology for food quality control. Trends in Food Science & Technology, 135, 245-258.
  26. Zhao, X.L. (2024). Emerging sensor-based technologies for food safety and quality monitoring. Sensors and Actuators B: Chemical, 389, 134934.
CAPTCHA Image