Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/10627
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Alsaad, A | - |
dc.contributor.author | Abbod, M | - |
dc.date.accessioned | 2015-04-22T15:13:32Z | - |
dc.date.available | 2014 | - |
dc.date.available | 2015-04-22T15:13:32Z | - |
dc.date.issued | 2014 | - |
dc.identifier.citation | Proceedings - UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSim 2014: 125 - 130, Cambridge, (26-28 March 2014 ) | en_US |
dc.identifier.isbn | 9781479949236 | - |
dc.identifier.uri | http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7046050 | - |
dc.identifier.uri | http://bura.brunel.ac.uk/handle/2438/10627 | - |
dc.description.abstract | Arabic language is vastly inflected, thus the process of effective Arabic text analysis with correct stem and root extraction is challenging. In this paper we present a linguistic root extraction approach that is composed of two main phases. In the first phase we handle removal of affixes including prefixes, suffixes and infixes. Prefixes and suffixes are removed depending on the length of the word, while checking its morphological pattern after each deduction to remove infixes. In the second phase, the root extraction algorithm is developed further to handle weak, hamzated, eliminated-long-vowel and two-letter geminated words as there is a rationally great amount of irregular Arabic words in texts. Before roots are extracted, they are checked against a predefined list of 3800 triliteral and 900 quad literal roots. Series of experiments has been conducted to improve and test the performance of the proposed algorithm. The obtained results revealed that the roots are extracted correctly has improved comparing with Khoja's stemming algorithm. | en_US |
dc.format.extent | 125 - 130 | - |
dc.format.extent | 125 - 130 | - |
dc.language.iso | en | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
dc.subject | Arabic root extraction | en_US |
dc.subject | Data mining | en_US |
dc.subject | Morphological analyser | en_US |
dc.subject | Natural language processing | en_US |
dc.subject | Text mining | en_US |
dc.title | Arabic text root extraction via morphological analysis and linguistic constraints | en_US |
dc.type | Conference Paper | en_US |
dc.identifier.doi | http://dx.doi.org/10.1109/UKSim.2014.43 | - |
dc.relation.isPartOf | Proceedings - UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSim 2014 | - |
dc.relation.isPartOf | Proceedings - UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSim 2014 | - |
pubs.organisational-data | /Brunel | - |
pubs.organisational-data | /Brunel/Brunel Staff by College/Department/Division | - |
pubs.organisational-data | /Brunel/Brunel Staff by College/Department/Division/College of Engineering, Design and Physical Sciences | - |
pubs.organisational-data | /Brunel/Brunel Staff by College/Department/Division/College of Engineering, Design and Physical Sciences/Dept of Electronic and Computer Engineering | - |
pubs.organisational-data | /Brunel/Brunel Staff by College/Department/Division/College of Engineering, Design and Physical Sciences/Dept of Electronic and Computer Engineering/Electronic and Computer Engineering | - |
pubs.organisational-data | /Brunel/Brunel Staff by Institute/Theme | - |
pubs.organisational-data | /Brunel/Brunel Staff by Institute/Theme/Institute of Energy Futures | - |
pubs.organisational-data | /Brunel/Brunel Staff by Institute/Theme/Institute of Energy Futures/Smart Power Networks | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups/Brunel Business School - URCs and Groups | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups/Brunel Business School - URCs and Groups/Centre for Research into Entrepreneurship, International Business and Innovation in Emerging Markets | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups/School of Health Sciences and Social Care - URCs and Groups | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups/School of Health Sciences and Social Care - URCs and Groups/Brunel Institute for Ageing Studies | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups/School of Health Sciences and Social Care - URCs and Groups/Brunel Institute of Cancer Genetics and Pharmacogenomics | - |
pubs.organisational-data | /Brunel/University Research Centres and Groups/School of Health Sciences and Social Care - URCs and Groups/Centre for Systems and Synthetic Biology | - |
Appears in Collections: | Dept of Electronic and Electrical Engineering Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Fulltext.docx | 1.7 MB | Unknown | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.