OpenCrystalData: An open-access particle image database to facilitate learning, experimentation, and development of image analysis models for crystallization processes

被引:3
|
作者
Barhate, Yash [1 ]
Boyle, Christopher [2 ,3 ]
Salami, Hossein [4 ]
Wu, Wei-Lee [1 ]
Taherimakhsousi, Nina [5 ]
Rabinowitz, Charlie [5 ]
Bommarius, Andreas [4 ]
Cardona, Javier [2 ,3 ,6 ]
Nagy, Zoltan K. [1 ]
Rousseau, Ronald [4 ]
Grover, Martha [4 ]
机构
[1] Purdue Univ, Davidson Sch Chem Engn, W Lafayette, IN USA
[2] Univ Strathclyde, CMAC EPSRC Future Mfg Res Hub, Glasgow City, Scotland
[3] Univ Strathclyde, Dept Chem & Proc Engn, Glasgow City, Scotland
[4] Georgia Inst Technol, Sch Chem & Biomol Engn, Atlanta, GA 30332 USA
[5] Mettler Toledo AutoChem Inc, Columbia, MD 21046 USA
[6] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow City, Scotland
来源
基金
英国工程与自然科学研究理事会;
关键词
Crystallization; Process analytical technology; Imaging; Open-access database; Machine learning; PROCESS ANALYTICAL TECHNOLOGY; SIZE DISTRIBUTION; SENSOR;
D O I
10.1016/j.dche.2024.100150
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Imaging and image-based process analytical technologies (PAT) have revolutionized the design, development, and operation of crystallization processes, providing greater process understanding through the characterization of particle size, shape and crystallization mechanisms in real -time. The performance of corresponding PAT models, including machine learning/artificial intelligence (ML/AI)-based approaches, is highly reliant on the data quality used for training or validation. However, acquiring high quality data is often time consuming and a major roadblock in developing image analysis models for crystallization processes. To address the lack of diverse, high-quality, and publicly available particle image datasets, this paper presents an initiative to create an open-access crystallization-related image database: OpenCrystalData (OCD, at www.ka ggle.com/opencrystaldata/datasets). The datasets consist of images from different crystallization systems with different particle sizes and shapes captured under various conditions. The initial release consists of four different datasets, addressing the estimation of particle size distribution using in -situ images for different categories of particles and detection of anomalous particles for process monitoring purposes. Images are collected using various instruments, followed by case-specific processing steps, such as ground-truth labeling and particle size characterization using offline microscopy. Datasets are released on the online collaborative platform Kaggle, along with specific guidelines for each dataset. These datasets are aimed to serve as a resource for researchers to enable learning, experimentation, development, and evaluation and comparison of different analytical approaches and algorithms. Another goal of this initiative is to encourage researchers to contribute new datasets focusing on various systems and problem statements. Ultimately, OpenCrystalData is intended to facilitate and inspire new developments in imaging-based PAT for crystallization processes, encouraging a shift from timeconsuming offline analysis towards comprehensive real -time process insights that drive product quality.
引用
收藏
页数:7
相关论文
共 11 条
  • [1] An open-access breast lesion ultrasound image database: Applicable in artificial intelligence studies
    Ardakani, Ali Abbasian
    Mohammadi, Afshin
    Mirza-Aghazadeh-Attari, Mohammad
    Acharya, U. Rajendra
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [2] In situ particle size estimation for crystallization processes by multivariate image analysis
    Sarkar, Debasis
    Doan, Xuan-Tien
    Ying, Zhou
    Srinivasan, Rajagopalan
    CHEMICAL ENGINEERING SCIENCE, 2009, 64 (01) : 9 - 19
  • [3] A verified open-access AI-based chemical microparticle image database for in-situ particle visualization and quantification in multi-phase flow
    Liu, Jian
    Zhang, Qingyang
    Chen, Mingyang
    Gao, Zhenguo
    Rohani, Sohrab
    Gong, Junbo
    CHEMICAL ENGINEERING JOURNAL, 2023, 451
  • [4] Open-access database for digital lenslessholographic microscopy and its application on theimprovement of deep-learning-basedautofocusing models
    Buitrago-Duque, Carlos
    Tobon-Maya, Heberley
    Gomez-Ramirez, Alejandra
    Zapata-valencia, Samuel I.
    Lopera, Maria J.
    Trujillo, Carlos
    Garcia-Sucerquia, Jorge
    APPLIED OPTICS, 2024, 63 (07) : B49 - B58
  • [5] The Wood Image Analysis and Dataset (WIAD): Open-access visual analysis tools to advance the ecological data revolution
    Rademacher, Tim
    Seyednasrollah, Bijan
    Basler, David J.
    Cheng, Jian
    Mandra, Tessa
    Miller, Elise
    Lin, Zuid
    Orwig, David A.
    Pederson, Neil
    Pfister, Hanspeter
    Wei, Donglai
    Yao, Li
    Richardson, Andrew D.
    METHODS IN ECOLOGY AND EVOLUTION, 2021, 12 (12): : 2379 - 2387
  • [6] Comparing Open-Access Database and Traditional Intensive Care Studies Using Machine Learning: Bibliometric Analysis Study
    Ke, Yuhe
    Yang, Rui
    Liu, Nan
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [7] Novel image analysis method for in situ monitoring the particle size distribution of batch crystallization processes
    Presles, Benoit
    Debayle, Johan
    Fevotte, Gilles
    Pinoli, Jean-Charles
    JOURNAL OF ELECTRONIC IMAGING, 2010, 19 (03)
  • [8] Deep learning-based on-line image analysis for continuous industrial crystallization processes
    Zong, Shiliang
    Zhou, Guangzheng
    Li, Meng
    Wang, Xuezhong
    PARTICUOLOGY, 2023, 74 : 173 - 183
  • [9] Deep Learning-Based Binocular Image Analysis for In Situ Measurement of Particle Length Distribution During Crystallization Process
    Fan, Ji
    Liu, Tao
    Shuang, Yongcan
    Song, Bo
    Chen, Junghui
    Tan, Yonghong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72 : 1 - 14
  • [10] An open-access computer image analysis (CIA) method to predict meat and fat content from an android smartphone-derived picture of the bovine 5th-6th rib
    Meunier, Bruno
    Normand, Jerome
    Albouy-Kissi, Benjamin
    Micol, Didier
    El Jabri, Mohammed
    Bonnet, Muriel
    METHODS, 2021, 186 : 79 - 89