Dried fish dataset for Indian seafood: A machine learning application

被引:4
|
作者
Paygude, Priyanka [1 ]
Gayakwad, Milind [1 ]
Wategaonkar, Dhanashri [2 ]
Pawar, Rajendra [3 ]
Pujeri, Ramchandra [3 ]
Joshi, Rahul [4 ]
机构
[1] Bharati Vidyapeeth, Coll Engn, Pune, India
[2] Dr Vishwanath Karad MIT World Peace Univ, Dept Comp Engn & Technol, Pune, India
[3] MIT Art Design & Technol Univ, MIT Sch Comp, Pune, India
[4] Symbiosis Int, Symbiosis Inst Technol, Pune, India
来源
DATA IN BRIEF | 2024年 / 55卷
关键词
Dried fish classification; Dried fish dataset; Dried fish detection; Machine learning;
D O I
10.1016/j.dib.2024.110563
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Dryingfish is a simple and economical way to process the catch. It creates a profitable business for coastal communities by providing a market for their catches, even during periods of abundance. It's a traditional method to preserve fish, especially valuable in regions where fresh fish isn't readily available or affordable throughout the year. This dataset provides a rich resource of 8290 images specifically designed for machine learning applications. It focuses on the five most popular types of dried seafood in India: prawns (shrimp), small anchovies (tingali), golden anchovies (mandeli), mackerel (bangada), and Bombay duck (bombil). To ensure high-quality data for machine learning applications for Identification and classification of different dried fish varieties, the dataset features a diverse set of images in singles and in bulk for each category. The dataset utilizes standardized lighting, background, and object pose for optimal machine learning performance. This rich dataset empowers researchers and data scientists to leverage machine learning for various applications in the Indian dried fish industry.Overall, the Dried Fish Dataset for Indian Seafood aims to leverage machine learning to improve the standardization, quality control, safety, and efficiency of the Indian dried fish industry. (c) 2024 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:8
相关论文
共 50 条
  • [21] A benchmark dataset for machine learning in ecotoxicology
    Schuer, Christoph
    Gasser, Lilian
    Perez-Cruz, Fernando
    Schirmer, Kristin
    Baity-Jesi, Marco
    SCIENTIFIC DATA, 2023, 10 (01)
  • [22] CuneiML: A Cuneiform Dataset for Machine Learning
    Chen, Danlu
    Agarwal, Aditi
    Berg-Kirkpatrick, Taylor
    Myerston, Jacobo
    JOURNAL OF OPEN HUMANITIES DATA, 2023, 9
  • [23] Predicting the Unpredictable: An Application of Machine Learning Algorithms in Indian Stock Market
    Saini A.
    Sharma A.
    Annals of Data Science, 2022, 9 (04) : 791 - 799
  • [24] Dried Fish Classification Using Deep Learning
    Laboni, Marfi Akter
    Rimi, Iffat Firozy
    Deb, Shrabosty
    Afrin, Farhina
    Hena, Most Hasna
    2022 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, COMPUTING, COMMUNICATION AND SUSTAINABLE TECHNOLOGIES (ICAECT), 2022,
  • [25] Dataset Generation Methodology: Towards Application of Machine Learning in Industrial Water Treatment Security
    Novikova E.
    Fedorchenko E.
    Danilov A.
    Saenko I.
    SN Computer Science, 5 (4)
  • [26] Accelerating the Design of Photocatalytic Surfaces for Antimicrobial Application: Machine Learning Based on a Sparse Dataset
    Park, Heesoo
    Bentria, El Tayeb
    Rtimi, Sami
    Arredouani, Abdelilah
    Bensmail, Halima
    El-Mellouhi, Fedwa
    CATALYSTS, 2021, 11 (08)
  • [27] Application of machine learning and complex network measures to an EEG dataset from ayahuasca experiments
    Alves, Caroline L.
    Cury, Rubens Gisbert
    Roster, Kirstin
    Pineda, Aruane M.
    Rodrigues, Francisco A.
    Thielemann, Christiane
    Ciba, Manuel
    PLOS ONE, 2022, 17 (12):
  • [28] Predictive modeling of water quality index (WQI) classes in Indian rivers: Insights from the application of multiple Machine Learning (ML) models on a decennial dataset
    Singh, Shailja
    Das, Anirban
    Sharma, Paawan
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2024, 38 (08) : 3221 - 3238
  • [29] Application of Big Data Analytics and Machine Learning to Large-Scale Synchrophasor Datasets: Evaluation of Dataset 'Machine Learning-Readiness'
    Hart, Philip
    He, Lijun
    Wang, Tianyi
    Kumar, Vijay S.
    Aggour, Kareem
    Subramanian, Arun
    Yan, Weizhong
    IEEE OPEN ACCESS JOURNAL OF POWER AND ENERGY, 2022, 9 : 386 - 397
  • [30] DescribeML: A dataset description tool for machine learning
    Giner-Miguelez, Joan
    Gomez, Abel
    Cabot, Jordi
    SCIENCE OF COMPUTER PROGRAMMING, 2024, 231