Evolutionary undersampling for extremely imbalanced big data classification under apache spark
The classification of datasets with a skewed class distribution is an important problem in data mining. Evolutionary undersampling of the majority class has proved to be a successful approach to tackle this issue. Such a challenging task may become even more difficult when the number of the majority...
| Main Authors: | Triguero, Isaac, Galar, M., Merino, D., Maillo, Jesus, Bustince, H., Herrera, Francisco |
|---|---|
| Format: | Conference or Workshop Item |
| Published: |
2016
|
| Subjects: | |
| Online Access: | https://eprints.nottingham.ac.uk/38876/ |
Similar Items
kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data
by: Maillo, Jesus, et al.
Published: (2016)
by: Maillo, Jesus, et al.
Published: (2016)
A first attempt on global evolutionary undersampling for
imbalanced big data
by: Triguero, Isaac, et al.
Published: (2017)
by: Triguero, Isaac, et al.
Published: (2017)
From Big data to Smart Data with the K-Nearest
Neighbours algorithm
by: Triguero, Isaac, et al.
Published: (2016)
by: Triguero, Isaac, et al.
Published: (2016)
ROSEFW-RF: the winner algorithm for the ECBDL’14 big data competition: an extremely imbalanced big data bioinformatics problem
by: Triguero, Isaac, et al.
Published: (2015)
by: Triguero, Isaac, et al.
Published: (2015)
Unstructured big data processing in cloud computing environment by using Amazon Elastic Map Reduce
by: Busu, Norzaharawani
Published: (2017)
by: Busu, Norzaharawani
Published: (2017)
Local and global measures for measuring performance of big data analytics process
by: Ali, Ismail Mohamed
Published: (2019)
by: Ali, Ismail Mohamed
Published: (2019)
EPRENNID: An evolutionary prototype reduction based ensemble for nearest neighbor classification of imbalanced data
by: Vluymans, Sarah, et al.
Published: (2016)
by: Vluymans, Sarah, et al.
Published: (2016)
MRPR: a MapReduce solution for prototype reduction in big data classification
by: Triguero, Isaac, et al.
Published: (2015)
by: Triguero, Isaac, et al.
Published: (2015)
Factors influencing user intention towards big data technology adoptions in educational organizations
by: Harun, Noor Baizura
Published: (2021)
by: Harun, Noor Baizura
Published: (2021)
Proportion of nonsteroidal anti-inflammatory drug prescription in equine practice
by: Marco, Duz, et al.
Published: (2018)
by: Marco, Duz, et al.
Published: (2018)
Knowledge grid to facilitate knowledge sharing model in big data community
by: Hosseinioun, Sara
Published: (2022)
by: Hosseinioun, Sara
Published: (2022)
Factors affecting successful big data analytics implementation in public sector of Malaysia
by: Adrian, Cecilia
Published: (2019)
by: Adrian, Cecilia
Published: (2019)
Developments in the right to be forgotten
by: McGoldrick, Dominic
Published: (2013)
by: McGoldrick, Dominic
Published: (2013)
Robust spatial diagnostic method and parameter estimation for spatial big data regression model
by: Ali, Mohammed Baba
Published: (2022)
by: Ali, Mohammed Baba
Published: (2022)
Intention to use big data technology in teaching among higher education educators in Yunnan, China
by: Wang, Qianhui
Published: (2022)
by: Wang, Qianhui
Published: (2022)
Verification of the HDM-4 fuel consumption model using a Big data approach: a UK case study
by: Perrotta, Federico, et al.
Published: (2019)
by: Perrotta, Federico, et al.
Published: (2019)
Democracy and human rights: concepts, measures, and relationships
by: Landman, Todd
Published: (2017)
by: Landman, Todd
Published: (2017)
A supervised adverse drug reaction signalling framework imitating Bradford Hill’s causality considerations
by: Reps, Jenna M., et al.
Published: (2015)
by: Reps, Jenna M., et al.
Published: (2015)
A framework for accelerated product innovation in a big data environment
by: Zhan, Yuanzhu
Published: (2017)
by: Zhan, Yuanzhu
Published: (2017)
Using big data to make better decisions in the digital economy
by: Tan, Kim Hua, et al.
Published: (2017)
by: Tan, Kim Hua, et al.
Published: (2017)
Sentiment analysis of hotel reviews in Singapore
by: Wong, Wai Ming
Published: (2020)
by: Wong, Wai Ming
Published: (2020)
The North West Shelf (NWS), a Digital Petroleum Ecosystem (PDE) in a Big Data Scale
by: Nimmagadda, Shastri, et al.
Published: (2018)
by: Nimmagadda, Shastri, et al.
Published: (2018)
Zynga’s FarmVille, social games, and the ethics of big data mining
by: Willson, Michele, et al.
Published: (2015)
by: Willson, Michele, et al.
Published: (2015)
A proposed learner activity taxonomy and a framework for analysing learner engagement versus performance using big educational data
by: Konstantinidis, Stathis, et al.
Published: (2017)
by: Konstantinidis, Stathis, et al.
Published: (2017)
Multi-source data fusion for land use classification using deep learning
by: Cao, Rui
Published: (2021)
by: Cao, Rui
Published: (2021)
Spark ignition engine combustion process analysis
by: Wiseman, Marc William
Published: (1990)
by: Wiseman, Marc William
Published: (1990)
Big data challenges & opportunities for development using Hadoop 2.0 platform
by: Hegazi, Abdel Rahman Farag
Published: (2014)
by: Hegazi, Abdel Rahman Farag
Published: (2014)
Detecting danger in roads: an immune-inspired technique to identify heavy goods vehicles incident hot spots
by: Figueredo, Grazziela P., et al.
Published: (2017)
by: Figueredo, Grazziela P., et al.
Published: (2017)
Sensor networks and personal health data management: software engineering challenges
by: Zhang, Xiang, et al.
Published: (2020)
by: Zhang, Xiang, et al.
Published: (2020)
KEEL 3.0: an open source software for multi-stage analysis in data mining
by: Triguero, Isaac, et al.
Published: (2017)
by: Triguero, Isaac, et al.
Published: (2017)
Using Big Data to manage safety-related risk in the upstream oil & gas industry: a research agenda
by: Tan, Kim Hua, et al.
Published: (2016)
by: Tan, Kim Hua, et al.
Published: (2016)
Exploring the impact of road surface conditions on truck fleet fuel consumption through Big Data
by: Perrotta, Federico
Published: (2019)
by: Perrotta, Federico
Published: (2019)
Comparison of truck fuel consumption measurements with results of existing models and implications for road pavement LCA
by: Perrotta, Federico, et al.
Published: (2018)
by: Perrotta, Federico, et al.
Published: (2018)
Indexing strategies of MapReduce for information retrieval in big data
by: Ramadhan, Mazen Farid Ebrahim
Published: (2016)
by: Ramadhan, Mazen Farid Ebrahim
Published: (2016)
Small fish in a big pond: an architectural approach to users privacy, rights and security in the age of big data
by: Angelopoulos, Spyros, et al.
Published: (2016)
by: Angelopoulos, Spyros, et al.
Published: (2016)
Data warehouse structuring methodologies for efficient mining of Western Australian petroleum data sources
by: Nimmagadda, Shastri, et al.
Published: (2005)
by: Nimmagadda, Shastri, et al.
Published: (2005)
Data warehousing and mining technologies for adaptability in turbulent resources business environments
by: Nimmagadda, Shastri, et al.
Published: (2011)
by: Nimmagadda, Shastri, et al.
Published: (2011)
An improvement algoithm for Iris classification by using Linear Support Vector Machine (LSVM), k-Nearest Neighbours (k-NN) and Random Nearest Neighbous (RNN) / Ahmad Haadzal Kamarulzalis and Mohd Asrul Affendi Abdullah
by: Kamarulzalis, Ahmad Haadzal, et al.
Published: (2019)
by: Kamarulzalis, Ahmad Haadzal, et al.
Published: (2019)
The next generation fungal diversity researcher
by: Grube, Martin, et al.
Published: (2017)
by: Grube, Martin, et al.
Published: (2017)
Detecting and analysing changes in consumer behaviour during life events
by: Darler, William
Published: (2019)
by: Darler, William
Published: (2019)
Similar Items
-
kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data
by: Maillo, Jesus, et al.
Published: (2016) -
A first attempt on global evolutionary undersampling for
imbalanced big data
by: Triguero, Isaac, et al.
Published: (2017) -
From Big data to Smart Data with the K-Nearest
Neighbours algorithm
by: Triguero, Isaac, et al.
Published: (2016) -
ROSEFW-RF: the winner algorithm for the ECBDL’14 big data competition: an extremely imbalanced big data bioinformatics problem
by: Triguero, Isaac, et al.
Published: (2015) -
Unstructured big data processing in cloud computing environment by using Amazon Elastic Map Reduce
by: Busu, Norzaharawani
Published: (2017)