pyBioPortal: a Python']Python package for simplifying cBioPortal data access in cancer research

被引:0
|
作者
Valerio, Matteo [1 ]
Inno, Alessandro [1 ]
Gori, Stefania [1 ]
机构
[1] IRCCS Sacro Cuore Don Calabria Hosp, Med Oncol, Via Don A Sempreboni,5, I-37024 Verona, Negrar di Valpo, Italy
关键词
cBioPortal; cancer research; bioinformatics; !text type='Python']Python[!/text;
D O I
10.1093/jamiaopen/ooae146
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives In recent years, the rise of big data and artificial intelligence has led to an increasing expansion of databases and web services in biomedical research. cBioPortal is one of the most widely used platforms for accessing cancer genomic and clinical data. The primary objective of this study was to develop a tool that simplifies programmatic interaction with cBioPortal's web service.Materials and Methods We developed the pyBioPortal Python package, which leverages the cBioPortal REST API to access genomic and clinical data. The retrieved data is returned as a Pandas DataFrame, a format widely used for data analysis in Python.Results pyBioPortal offers an efficient interface between the user and the cBioPortal database. The data is provided in formats conducive to further analysis and visualization, promoting workflows and improving reproducibility.Discussion The development of pyBioPortal addresses the challenge of accessing and processing large volumes of biomedical data. By simplifying the interaction with the cBioPortal API and providing data in Pandas DataFrame format, pyBioPortal allows users to focus more on the analytical aspects rather than data extraction.Conclusion This tool facilitates the retrieval of heterogeneous biological and clinical data in a standardized format, making it more accessible for analysis and enhancing the reproducibility of results in cancer informatics. Distributed as an open-source project, pyBioPortal is available to the broader bioinformatics community, promoting collaboration and advancing research in cancer genomics. The advent of big data and artificial intelligence has revolutionized cancer research, making large amounts of data available for analysis. However, accessing this data can be challenging, especially for researchers without specific programming skills or those who prefer to focus on data analysis and interpretation rather than on the technical aspects of data extraction. pyBioPortal is a newly developed Python tool designed to simplify the process of retrieving data from cBioPortal, a widely used platforms for accessing cancer genomic and clinical data. By making data easier to access and analyze, pyBioPortal enables researchers to focus on uncovering new insights that drive advancements in cancer research. The tool is freely available as an open-source project, promoting widespread use and collaboration within the bioinformatics community.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] IMCP: A Python']Python package for imbalanced and multiclass data classifier performance comparison
    Aguilar-Ruiz, Jesus S.
    Michalak, Marcin
    Wrobel, Lukasz
    SOFTWAREX, 2024, 28
  • [32] PyMix - The Python']Python mixture package - a tool for clustering of heterogeneous biological data
    Georgi, Benjamin
    Costa, Ivan Gesteira
    Schliep, Alexander
    BMC BIOINFORMATICS, 2010, 11
  • [33] GenomeDiagram: a python']python package for the visualization of large-scale genomic data
    Pritchard, L
    White, JA
    Birch, PRJ
    Toth, IK
    BIOINFORMATICS, 2006, 22 (05) : 616 - 617
  • [34] PySAP: Python']Python Sparse Data Analysis Package for multidisciplinary image processing
    Farrens, S.
    Grigis, A.
    El Gueddari, L.
    Ramzi, Z.
    Chaithya, G. R.
    Starck, S.
    Sarthou, B.
    Cherkaoui, H.
    Ciuciu, P.
    Starck, J-L
    ASTRONOMY AND COMPUTING, 2020, 32
  • [35] refellips: A Python']Python package for the analysis of variable angle spectroscopic ellipsometry data
    Robertson, Hayden
    Gresham, Isaac J.
    Prescott, Stuart W.
    Webber, Grant B.
    Wanless, Erica J.
    Nelson, Andrew
    SOFTWAREX, 2022, 20
  • [36] Nmrglue: an open source Python']Python package for the analysis of multidimensional NMR data
    Helmus, Jonathan J.
    Jaroniec, Christopher P.
    JOURNAL OF BIOMOLECULAR NMR, 2013, 55 (04) : 355 - 367
  • [37] MVTS-Data Toolkit: A Python']Python package for preprocessing multivariate time series data
    Ahmadzadeh, Azim
    Sinha, Kankana
    Aydin, Berkay
    Angryk, Rafal A.
    SOFTWAREX, 2020, 12
  • [38] Environmental Insights: Democratizing access to ambient air pollution data and predictive analytics with an open-source Python']Python package
    Berrisford, Liam J.
    Menezes, Ronaldo
    ENVIRONMENTAL MODELLING & SOFTWARE, 2024, 179
  • [39] tableone: An open source Python']Python package for producing summary statistics for research papers
    Pollard, Tom J.
    Johnson, Alistair E. W.
    Raffa, Jesse D.
    Mark, Roger G.
    JAMIA OPEN, 2018, 1 (01) : 26 - 31
  • [40] LDAQ: An Open-Source Python']Python Package for Data Acquisition and Signal Generation
    Kosir, Tilen
    Zaletelj, Klemen
    Slavic, Janko
    SPECIAL TOPICS IN STRUCTURAL DYNAMICS & EXPERIMENTAL TECHNIQUES, VOL 5, 2024, : 109 - 111