Skip to main content

Articles

Page 3 of 19

  1. As a means of building explainable machine learning models for Big Data, we apply a novel ensemble supervised feature selection technique. The technique is applied to publicly available insurance claims data f...

    Authors: John T. Hancock, Richard A. Bauder, Huanjing Wang and Taghi M. Khoshgoftaar
    Citation: Journal of Big Data 2023 10:154
  2. Under-sampling is a technique to overcome imbalanced class problem, however, selecting the instances to be dropped and measuring their informativeness is an important concern. This paper tries to bring up a ne...

    Authors: Tayyebe Feizi, Mohammad Hossein Moattar and Hamid Tabatabaee
    Citation: Journal of Big Data 2023 10:153
  3. Pollen identification is necessary for several subfields of geology, ecology, and evolutionary biology. However, the existing methods for pollen identification are laborious, time-consuming, and require highly...

    Authors: Masoud A. Rostami, Behnaz Balmaki, Lee A. Dyer, Julie M. Allen, Mohamed F. Sallam and Fabrizio Frontalini
    Citation: Journal of Big Data 2023 10:151
  4. Many agencies and organizations, such as the U.S. Geological Survey, handle massive geospatial datasets and their auxiliary data and are thus faced with challenges in storing data and ingesting it, transferrin...

    Authors: S. T. Arundel, K. G. McKeehan, B. B. Campbell, A. N. Bulen and P. T. Thiem
    Citation: Journal of Big Data 2023 10:146
  5. Automated detection of defects on metal surfaces is crucial for ensuring quality control. However, the scarcity of labeled datasets for emerging target defects poses a significant obstacle. This study proposes...

    Authors: Mahe Zabin, Anika Nahian Binte Kabir, Muhammad Khubayeeb Kabir, Ho-Jin Choi and Jia Uddin
    Citation: Journal of Big Data 2023 10:145
  6. The identification and prognosis of the potential for developing Cardiovascular Diseases (CVD) in healthy individuals is a vital aspect of disease management. Accessing the comprehensive health data on CVD cur...

    Authors: Nadiah A. Baghdadi, Sally Mohammed Farghaly Abdelaliem, Amer Malki, Ibrahim Gad, Ashraf Ewis and Elsayed Atlam
    Citation: Journal of Big Data 2023 10:144
  7. Although Parkinson’s disease (PD) has a heterogeneous disease course, it remains challenging to establish subtypes. We described and clustered the natural course of Parkinson’s disease (PD) with respect to fun...

    Authors: Dougho Park, Su Yun Lee, Jong Hun Kim and Hyoung Seop Kim
    Citation: Journal of Big Data 2023 10:140
  8. Fin-Tech is the merging of finance and technology, to be considered a key term for technology-based financial operations and money transactions as far as Fin-Tech is concerned. In the massive field of business...

    Authors: Habib Ullah Khan, Muhammad Sohail, Shah Nazir, Tariq Hussain, Babar Shah and Farman Ali
    Citation: Journal of Big Data 2023 10:138
  9. Large unbalanced datasets pose challenges for machine learning models, as redundant and irrelevant features can hinder their effectiveness. Furthermore, the performance of intrusion detection systems (IDS) can...

    Authors: Kezhou Ren, Yifan Zeng, Yuanfu Zhong, Biao Sheng and Yingchao Zhang
    Citation: Journal of Big Data 2023 10:137
  10. Real-time object tracking and occlusion handling are critical research areas in computer vision and machine learning. Developing an efficient and accurate object-tracking method that can operate in real-time w...

    Authors: Devira Anggi Maharani, Carmadi Machbub, Lenni Yulianti and Pranoto Hidaya Rusmin
    Citation: Journal of Big Data 2023 10:136
  11. Screening for hyperthyroidism using gold-standard diagnostic criteria in the general population is not cost-effective, leading to a relatively high rate of undiagnosed and untreated patients. This study aimed ...

    Authors: Li Dong, Lie Ju, Shiqi Hui, Lihua Luo, Xue Jiang, Zihan Nie, Ruiheng Zhang, Wenda Zhou, Heyan Li, Jost B. Jonas, Xin Wang, Xin Zhao, Chao He, Yuzhong Chen, Zhaohui Wang, Jianxiong Gao…
    Citation: Journal of Big Data 2023 10:134
  12. Triple-negative breast cancer (TNBC) is a relatively aggressive breast cancer subtype due to tumor relapse, drug resistance, and multi-organ metastatic properties. Identifying reliable biomarkers to predict pr...

    Authors: Shuyu Li, Nan Zhang, Hao Zhang, Ran Zhou, Zirui Li, Xue Yang, Wantao Wu, Hanning Li, Peng Luo, Zeyu Wang, Ziyu Dai, Xisong Liang, Jie Wen, Xun Zhang, Bo Zhang, Quan Cheng…
    Citation: Journal of Big Data 2023 10:132
  13. The existing Fisher’s exact test has been widely applied for investigating whether the difference between the observed frequencies is significant or not. The existing Fisher’s exact test can be applied only wh...

    Authors: Muhammad Aslam and Faten S. Alamri
    Citation: Journal of Big Data 2023 10:131
  14. Hepatocellular carcinoma (HCC) represents a formidable malignancy with a high lethality. Nonetheless, the development of vaccine and the establishment of prognostic models for precise and personalized treatmen...

    Authors: Zhiyuan Zheng, Hantao Yang, Yang Shi, Feng Zhou, Lingxiao Liu, Zhiping Yan and Xiaolin Wang
    Citation: Journal of Big Data 2023 10:129
  15. Internet of Things (IoT) driven systems have been sharply growing in the recent times but this evolution is hampered by cybersecurity threats like spoofing, denial of service (DoS), distributed denial of servi...

    Authors: Yasir Ali, Habib Ullah Khan and Muhammad Khalid
    Citation: Journal of Big Data 2023 10:128
  16. Answering questions related to the legal domain is a complex task, primarily due to the intricate nature and diverse range of legal document systems. Providing an accurate answer to a legal query typically nec...

    Authors: Abdelrahman Abdallah, Bhawna Piryani and Adam Jatowt
    Citation: Journal of Big Data 2023 10:127
  17. The existing semi-average method under classical statistics is applied to measure the trend in the time series data. The existing semi-average method cannot be applied when the time series data is in intervals...

    Authors: Muhammad Aslam
    Citation: Journal of Big Data 2023 10:126
  18. Mobility data of a moving object, called trajectory data, are continuously generated by vessel navigation systems, wearable devices, and drones, to name a few. Trajectory data consist of samples that include t...

    Authors: Bakht Zaman, Dogan Altan, Dusica Marijan and Tetyana Kholodna
    Citation: Journal of Big Data 2023 10:123
  19. Keratitis is a major cause of corneal blindness worldwide. Early identification and timely treatment of keratitis can deter the disease progression, reaching a better prognosis. The diagnosis of keratitis ofte...

    Authors: Jiewei Jiang, Wei Liu, Mengjie Pei, Liufei Guo, Jingshi Yang, Chengchao Wu, Jiaojiao Lu, Ruijie Gao, Wei Chen, Jiamin Gong, Mingmin Zhu and Zhongwen Li
    Citation: Journal of Big Data 2023 10:121
  20. The k-means, one of the most widely used clustering algorithm, is not only faster in computation but also produces comparatively better clusters. However, it has two major downsides, first it is sensitive to i...

    Authors: Marina Gul and M. Abdul Rehman
    Citation: Journal of Big Data 2023 10:120
  21. Users on social networks such as Twitter interact with each other without much knowledge of the real-identity behind the accounts they interact with. This anonymity has created a perfect environment for bot ac...

    Authors: Ashkan Dehghan, Kinga Siuta, Agata Skorupka, Akshat Dubey, Andrei Betlen, David Miller, Wei Xu, Bogumił Kamiński and Paweł Prałat
    Citation: Journal of Big Data 2023 10:119
  22. Neurological diseases are on the rise worldwide, leading to increased healthcare costs and diminished quality of life in patients. In recent years, Big Data has started to transform the fields of Neuroscience ...

    Authors: Laura Dipietro, Paola Gonzalez-Mego, Ciro Ramos-Estebanez, Lauren Hana Zukowski, Rahul Mikkilineni, Richard Jarrett Rushmore and Timothy Wagner
    Citation: Journal of Big Data 2023 10:116
  23. With the development of computer vision technology, the demand for deploying vision inspection tasks on edge mobile devices is becoming increasingly widespread. To meet the requirements of application scenario...

    Authors: Hu Gang, Sheng Guanglei, Wang Xiaofeng and Jiang Jinlin
    Citation: Journal of Big Data 2023 10:114
  24. Optical coherence tomography angiography (OCTA) has been a frequently used diagnostic method in neovascular age-related macular degeneration (nAMD) because it is non-invasive and provides a comprehensive view ...

    Authors: Wei Feng, Meihan Duan, Bingjie Wang, Yu Du, Yiran Zhao, Bin Wang, Lin Zhao, Zongyuan Ge and Yuntao Hu
    Citation: Journal of Big Data 2023 10:111
  25. Fraud datasets often times lack consistent and accurate labels, and are characterized by having high class imbalance where the number of fraudulent examples are far fewer than those of normal ones. Machine lea...

    Authors: Robert K. L. Kennedy, Zahra Salekshahrezaee, Flavio Villanustre and Taghi M. Khoshgoftaar
    Citation: Journal of Big Data 2023 10:106

Annual Journal Metrics

  • 2022 Citation Impact
    8.1 - 2-year Impact Factor
    5.095 - SNIP (Source Normalized Impact per Paper)
    2.714 - SJR (SCImago Journal Rank)

    2023 Speed
    56 days submission to first editorial decision for all manuscripts (Median)
    205 days submission to accept (Median)

    2023 Usage 
    2,559,548 downloads
    280 Altmetric mentions