Research and Development in Intelligent Systems XXVI - Incorporating Applications and Innovations in Intelligent Systems XVII

Research and Development in Intelligent Systems XXVI - Incorporating Applications and Innovations in Intelligent Systems XVII

von: Richard Ellis, Miltos Petridis

Springer-Verlag, 2009

ISBN: 9781848829831 , 504 Seiten

Format: PDF

Kopierschutz: Wasserzeichen

Windows PC,Mac OSX geeignet für alle DRM-fähigen eReader Apple iPad, Android Tablet PC's

Preis: 213,99 EUR

  • AutoCAD 2012 - Von der 2D-Linie zum 3D-Modell
    Organisiert (DIGITAL lifeguide) - Termine, Kontakte, Aufgaben immer & überall im Griff
    iTunes (DIGITAL lifeguide) - Die besten Tipps und Tricks für entspannten Musikgenuss
    Von PDM zu PLM - Prozessoptimierung durch Integration
    Konstruieren mit CAD - Das Komplettpaket für 3D Modellieren im Maschinenbau

     

     

     

     

 

Mehr zum Inhalt

Research and Development in Intelligent Systems XXVI - Incorporating Applications and Innovations in Intelligent Systems XVII


 

The most common document formalisation for text classi?cation is the vector space model founded on the bag of words/phrases representation. The main advantage of the vector space model is that it can readily be employed by classi?cation - gorithms. However, the bag of words/phrases representation is suited to capturing only word/phrase frequency; structural and semantic information is ignored. It has been established that structural information plays an important role in classi?cation accuracy [14]. An alternative to the bag of words/phrases representation is a graph based rep- sentation, which intuitively possesses much more expressive power. However, this representation introduces an additional level of complexity in that the calculation of the similarity between two graphs is signi?cantly more computationally expensive than between two vectors (see for example [16]). Some work (see for example [12]) has been done on hybrid representations to capture both structural elements (- ing the graph model) and signi?cant features using the vector model. However the computational resources required to process this hybrid model are still extensive.