Content area
Full Text
http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = Artif Intell Rev (2016) 46:113128
DOI 10.1007/s10462-016-9458-x
Mohammad Arshi Saloot1 Norisma Idris1 Rohana Mahmud1
Salinah Jaafar1 Dirk Thorleuchter2 Abdullah Gani1
Published online: 8 January 2016 Springer Science+Business Media Dordrecht 2016
Abstract Hadiths are important textual sources of law, tradition, and teaching in the Islamic world. Analyzing the unique linguistic features of Hadiths (e.g. ancient Arabic language and story-like text) results to compile and utilize specic natural language processing methods. In the literature, no study is solely focused on Hadith from articial intelligence perspective, while many new developments have been overlooked and need to be highlighted. Therefore, this review analyze all academic journal and conference publications that using two main methods of articial intelligence for Hadith text: Hadith classication and mining. All Hadith relevant methods and algorithms from the literature are discussed and analyzed in terms of functionality, simplicity, F-score and accuracy. Using various different Hadith datasets makes a direct comparison between the evaluation results impossible. Therefore, we have re-implemented and evaluated the methods using a single dataset (i.e. 3150 Hadiths from Sahih Al-Bukhari book). The result of evaluation on the classication method reveals that neural networks classify the Hadith with 94% accuracy. This is because neural networks are capable of handling complex (high dimensional) input data. The Hadith mining method that combines vector space model, Cosine similarity, and enriched queries obtains the best accuracy result
B Mohammad Arshi Saloot
Norisma Idris [email protected]
Rohana Mahmud [email protected]
Salinah Jaafar [email protected]
Dirk Thorleuchter [email protected]
Abdullah [email protected] University of Malaya, 50603 Kuala Lumpur, Malaysia2 Institute of Fraunhofer INT, Appelsgarten 2, 53879 Euskirchen, Germany
http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = http://crossmark.crossref.org/dialog/?doi=10.1007/s10462-016-9458-x&domain=pdf
Web End = Hadith data mining and classication: a comparative analysis
123
114 M. A. Saloot et al.
(i.e. 88%) among other re-evaluated Hadith mining methods. The most important aspect in Hadith mining methods is query expansion since the query must be tted to the Hadith lingo. The lack of knowledge based methods is evident in Hadith classication and mining approaches and this absence can be covered in future works using knowledge graphs.
Keywords Review Comparison Islamic knowledge Hadith Classication Data
mining