From Big data to Smart Data with the K-Nearest Neighbours algorithm
The k-nearest neighbours algorithm is one of the most widely used data mining models because of its simplicity and accurate results. However, when it comes to deal with big datasets, with potentially noisy and missing information, this technique becomes ineffective and inefficient. Due to its drawba...
| Main Authors: | Triguero, Isaac, Maillo, Jesus, Luengo, Julian, García, Salvador, Herrera, Francisco |
|---|---|
| Format: | Conference or Workshop Item |
| Published: |
2016
|
| Subjects: | |
| Online Access: | https://eprints.nottingham.ac.uk/42475/ |
Similar Items
MRPR: a MapReduce solution for prototype reduction in big data classification
by: Triguero, Isaac, et al.
Published: (2015)
by: Triguero, Isaac, et al.
Published: (2015)
kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data
by: Maillo, Jesus, et al.
Published: (2016)
by: Maillo, Jesus, et al.
Published: (2016)
EPRENNID: An evolutionary prototype reduction based ensemble for nearest neighbor classification of imbalanced data
by: Vluymans, Sarah, et al.
Published: (2016)
by: Vluymans, Sarah, et al.
Published: (2016)
Evolutionary undersampling for extremely imbalanced big data classification under apache spark
by: Triguero, Isaac, et al.
Published: (2016)
by: Triguero, Isaac, et al.
Published: (2016)
KEEL 3.0: an open source software for multi-stage analysis in data mining
by: Triguero, Isaac, et al.
Published: (2017)
by: Triguero, Isaac, et al.
Published: (2017)
Exact fuzzy k-Nearest neighbor classification for big datasets
by: Maillo, Jesus, et al.
Published: (2017)
by: Maillo, Jesus, et al.
Published: (2017)
Local and global measures for measuring performance of big data analytics process
by: Ali, Ismail Mohamed
Published: (2019)
by: Ali, Ismail Mohamed
Published: (2019)
Unstructured big data processing in cloud computing environment by using Amazon Elastic Map Reduce
by: Busu, Norzaharawani
Published: (2017)
by: Busu, Norzaharawani
Published: (2017)
ROSEFW-RF: the winner algorithm for the ECBDL’14 big data competition: an extremely imbalanced big data bioinformatics problem
by: Triguero, Isaac, et al.
Published: (2015)
by: Triguero, Isaac, et al.
Published: (2015)
Knowledge grid to facilitate knowledge sharing model in big data community
by: Hosseinioun, Sara
Published: (2022)
by: Hosseinioun, Sara
Published: (2022)
Factors affecting successful big data analytics implementation in public sector of Malaysia
by: Adrian, Cecilia
Published: (2019)
by: Adrian, Cecilia
Published: (2019)
Factors influencing user intention towards big data technology adoptions in educational organizations
by: Harun, Noor Baizura
Published: (2021)
by: Harun, Noor Baizura
Published: (2021)
Novel Strategies to Accelerate Search Algorithms in Data Reduction
by: LE, HOANG LAM
Published: (2022)
by: LE, HOANG LAM
Published: (2022)
Robust spatial diagnostic method and parameter estimation for spatial big data regression model
by: Ali, Mohammed Baba
Published: (2022)
by: Ali, Mohammed Baba
Published: (2022)
Sensor networks and personal health data management: software engineering challenges
by: Zhang, Xiang, et al.
Published: (2020)
by: Zhang, Xiang, et al.
Published: (2020)
Intention to use big data technology in teaching among higher education educators in Yunnan, China
by: Wang, Qianhui
Published: (2022)
by: Wang, Qianhui
Published: (2022)
An improvement algoithm for Iris classification by using Linear Support Vector Machine (LSVM), k-Nearest Neighbours (k-NN) and Random Nearest Neighbous (RNN) / Ahmad Haadzal Kamarulzalis and Mohd Asrul Affendi Abdullah
by: Kamarulzalis, Ahmad Haadzal, et al.
Published: (2019)
by: Kamarulzalis, Ahmad Haadzal, et al.
Published: (2019)
The Importance of Including the Geoid in Terrestrial Survey Data Reduction to the Geocentric Datum of Australia
by: Featherstone, Will
Published: (1997)
by: Featherstone, Will
Published: (1997)
Verification of the HDM-4 fuel consumption model using a Big data approach: a UK case study
by: Perrotta, Federico, et al.
Published: (2019)
by: Perrotta, Federico, et al.
Published: (2019)
A framework for accelerated product innovation in a big data environment
by: Zhan, Yuanzhu
Published: (2017)
by: Zhan, Yuanzhu
Published: (2017)
Using feature-based product modelling to integrate design and rapid prototyping
by: Campbell, Robert Ian
Published: (1998)
by: Campbell, Robert Ian
Published: (1998)
Using big data to make better decisions in the digital economy
by: Tan, Kim Hua, et al.
Published: (2017)
by: Tan, Kim Hua, et al.
Published: (2017)
Zynga’s FarmVille, social games, and the ethics of big data mining
by: Willson, Michele, et al.
Published: (2015)
by: Willson, Michele, et al.
Published: (2015)
SEG-SSC: a framework based on synthetic examples generation for self-labeled semi-supervised classification
by: Triguero, Isaac, et al.
Published: (2015)
by: Triguero, Isaac, et al.
Published: (2015)
The North West Shelf (NWS), a Digital Petroleum Ecosystem (PDE) in a Big Data Scale
by: Nimmagadda, Shastri, et al.
Published: (2018)
by: Nimmagadda, Shastri, et al.
Published: (2018)
Indexing strategies of MapReduce for information retrieval in big data
by: Ramadhan, Mazen Farid Ebrahim
Published: (2016)
by: Ramadhan, Mazen Farid Ebrahim
Published: (2016)
Big data challenges & opportunities for development using Hadoop 2.0 platform
by: Hegazi, Abdel Rahman Farag
Published: (2014)
by: Hegazi, Abdel Rahman Farag
Published: (2014)
Proportion of nonsteroidal anti-inflammatory drug prescription in equine practice
by: Marco, Duz, et al.
Published: (2018)
by: Marco, Duz, et al.
Published: (2018)
Small fish in a big pond: an architectural approach to users privacy, rights and security in the age of big data
by: Angelopoulos, Spyros, et al.
Published: (2016)
by: Angelopoulos, Spyros, et al.
Published: (2016)
A proposed learner activity taxonomy and a framework for analysing learner engagement versus performance using big educational data
by: Konstantinidis, Stathis, et al.
Published: (2017)
by: Konstantinidis, Stathis, et al.
Published: (2017)
Unlocking the power of big data in new product development
by: Zhan, Yuanzhu, et al.
Published: (2016)
by: Zhan, Yuanzhu, et al.
Published: (2016)
Test process optimisation through big data analysis
by: Kho, Xiang Juan
Published: (2023)
by: Kho, Xiang Juan
Published: (2023)
Big-data Integration Methodologies for effective management and data mining of petroleum digital ecosystems
by: Nimmagadda, Shastri, et al.
Published: (2013)
by: Nimmagadda, Shastri, et al.
Published: (2013)
Exploring the impact of road surface conditions on truck fleet fuel consumption through Big Data
by: Perrotta, Federico
Published: (2019)
by: Perrotta, Federico
Published: (2019)
A novel symbolization technique for time-series outlier detection
by: Smith, Gavin, et al.
Published: (2015)
by: Smith, Gavin, et al.
Published: (2015)
Using Big Data to manage safety-related risk in the upstream oil & gas industry: a research agenda
by: Tan, Kim Hua, et al.
Published: (2016)
by: Tan, Kim Hua, et al.
Published: (2016)
Achieving interoperability in mobility as a service: a data ecosystem leveraging Semantic Web Technologies
by: Essawy, Shams Khaled Elhosseny Ghazy
Published: (2024)
by: Essawy, Shams Khaled Elhosseny Ghazy
Published: (2024)
Predicting online e-marketplace sales performances: a big data approach
by: Li, Boying, et al.
Published: (2016)
by: Li, Boying, et al.
Published: (2016)
Multi-source data fusion for land use classification using deep learning
by: Cao, Rui
Published: (2021)
by: Cao, Rui
Published: (2021)
Data warehouse structuring methodologies for efficient mining of Western Australian petroleum data sources
by: Nimmagadda, Shastri, et al.
Published: (2005)
by: Nimmagadda, Shastri, et al.
Published: (2005)
Similar Items
-
MRPR: a MapReduce solution for prototype reduction in big data classification
by: Triguero, Isaac, et al.
Published: (2015) -
kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data
by: Maillo, Jesus, et al.
Published: (2016) -
EPRENNID: An evolutionary prototype reduction based ensemble for nearest neighbor classification of imbalanced data
by: Vluymans, Sarah, et al.
Published: (2016) -
Evolutionary undersampling for extremely imbalanced big data classification under apache spark
by: Triguero, Isaac, et al.
Published: (2016) -
KEEL 3.0: an open source software for multi-stage analysis in data mining
by: Triguero, Isaac, et al.
Published: (2017)