Vis enkel innførsel

dc.contributor.authorWu, Jimmy Ming-Tai
dc.contributor.authorSrivastava, Gautam
dc.contributor.authorWei, Min
dc.contributor.authorYun, Unil
dc.contributor.authorLin, Jerry Chun-Wei
dc.date.accessioned2021-10-07T13:13:26Z
dc.date.available2021-10-07T13:13:26Z
dc.date.created2021-01-18T14:12:20Z
dc.date.issued2021
dc.identifier.citationWu, J. M.-T., Srivastava, G., Wei, M., Yun, U., & Lin, J. C.-W. (2021). Fuzzy high-utility pattern mining in parallel and distributed Hadoop framework. Information Sciences, 553, 31-48.en_US
dc.identifier.issn0020-0255
dc.identifier.urihttps://hdl.handle.net/11250/2788432
dc.description.abstractOver the past decade, high-utility itemset mining (HUIM) has received widespread attention that can emphasize more critical information than was previously possible using frequent itemset mining (FIM). Unfortunately, HUIM is very similar to FIM since the methodology determines itemsets using a binary model based on a pre-defined minimum utility threshold. Additionally, most previous works only focused on single, small datasets in HUIM, which is not realistic to any real-world scenarios today containing big data environments. In this work, the fuzzy-set theory and a MapReduce framework are both utilized to design a novel high fuzzy utility pattern mining algorithm to resolve the above issues. Fuzzy-set theory is first involved and a new algorithm called efficient high fuzzy utility itemset mining (EFUPM) is designed to discover high fuzzy utility patterns from a single machine. Two upper-bounds are then estimated to allow early pruning of unpromising candidates in the search space. To handle the large-scale of big datasets, a Hadoop-based high fuzzy utility pattern mining (HFUPM) algorithm is then developed to discover high fuzzy utility patterns based on the Hadoop framework. Experimental results clearly show that the proposed algorithms perform strongly to mine the required high fuzzy utility patterns whether in a single machine or a large-scale environment compared to the current state-of-the-art approaches.en_US
dc.language.isoengen_US
dc.publisherElsevieren_US
dc.relation.urihttps://doi.org/10.1016/j.ins.2020.12.004
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.subjectHadoopen_US
dc.subjecthigh fuzzy utility patternen_US
dc.subjecthigh utility itemset miningen_US
dc.subjectbig-dataen_US
dc.subjectfuzzy-set theoryen_US
dc.subjectMapReduceen_US
dc.titleFuzzy high-utility pattern mining in parallel and distributed Hadoop frameworken_US
dc.typePeer revieweden_US
dc.typeJournal articleen_US
dc.description.versionpublishedVersionen_US
dc.rights.holder© 2021 The Author(s)en_US
dc.subject.nsiVDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420en_US
dc.source.pagenumber31-48en_US
dc.source.volume553en_US
dc.source.journalInformation Sciencesen_US
dc.identifier.doi10.1016/j.ins.2020.12.004
dc.identifier.cristin1873324
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal