Web Science and the Mind

Sunday, 8 June 2014

Mining Patterns from Linked Data

PETKO VALTCHEV
UQÀM

VIDEO

OVERVIEW: The Web of Data (WoD) can be seen as global database made of multiple datasets. These datasets are published separately — by using new or reusing existing schemas on the Web — yet get interlinked through either direct references between data items or indirect ones, i.e., identity links between items representing the same entity. The technology underlying the WoD, called Linked Data (LD) allows for the construction of a global data graph in which data items are vertices related by edges of different nature. Entities, aka resources, as well as their links, aka properties, are globally identified through URLs. Beside this inherent graph structure, parts of the WoD can behave as a traditional, i.e., relational, database.
After substantial efforts on the standards for publishing and querying of LD on the Web, and lately the interlinking and cleansing of sets of LD, the next big issue is properly extracting new knowledge from the WoD. Data Mining (DM) discipline is about finding chunks of useful knowledge hidden in the data. DM methods are roughly divided into predictive ones, where past experience is analyzed in order to guess what the outcome of an unfolding situation, and descriptive ones whose aim is to provide insights into the regularities in the data without a specific goal. Mining LD is both useful and challenging for many reasons, not the least among them being the rich and complex graph structure induced by a large variety of link types, the availability of domain knowledge expressed as schemas, and even fully-blown ontologies, the heterogeneity in the modelling goals behind individual datasets, etc.
In this talk we discuss the implications of LD for a specific branch of descriptive DM, called pattern mining. We present two different mining methods for that are complementary in many respects. The first one targets usage regularities: It analyses the consumption of resources from the WoD by the users of a specific semantic application and summarizes it as behavioural patterns. The second one mines purely descriptive patterns from a dataset of multiple resource types, which are expressed in a WoD-compliant language and therefore supports ontology design.

READINGS:

    M Rouane-Hacene, M Huchard, A Napoli, P Valtchev, Relational concept analysis: mining concept lattices from multi-relational data Annals of Mathematics and Artificial Intelligence 67 (1), 81-108, 2013
    MH Rouane, M Huchard, A Napoli, P Valtchev, A proposal for combining formal concept analysis and description logics for mining relational data Formal Concept Analysis (vol. of LNCS), 51-65, Springer, 2007
    M Adda, P Valtchev, R Missaoui, C Djeraba, A framework for mining meaningful usage patterns within a semantically enhanced web portal Proc. of the Third C* Conf. on Computer Science and Software,138-147, ACM, 2010
    M Adda, P Valtchev, R Missaoui, C Djeraba, Toward recommendation based on ontology-powered web-usage mining IEEE Internet Computing 11 (4), 45-52, 2007

29 comments:

Unknown14 July 2014 at 09:22
Dans le «Matching Function» qui tente de trouver la classe correspondant à un terme (Ex. Montréal -> Ville), comment fait-on pour faire la correspondance? Est-ce qu'il faut faire cette correspondance manuellement? Ou bien on doit effectuer une requête SPARQL sur LD et ainsi voir que Montréal est une Ville?

Si tel est le cas, qu'est-ce qui se passe si nous avons une ville X ne se trouvant pas dans LD?
ReplyDelete
Replies
Maxwell J. D. Ramstead14 July 2014 at 09:25
My question for Professor Valtchev is whether it is necessary for the search elements to be queried in a specific order for the data mining algorithms to function correctly, or whether the algorithms can identify and match patterns to queries regardless of the actual sequence of queries.
ReplyDelete
Replies
Unknown14 July 2014 at 09:26
Dear Petko, Thanks for your nice presentation! Don't you think recommender systems is the ideal paradigm to follow for hybrid cognitive computing applications with machine learning and ontologies modeling? A few arguments: a recommender system can be an incentive social machine for personally adapted learning. Ontologies provide a clear conceptual logical structure to provide computational model with concepts for cognitive science. Machine Learning provide induction and validation.
ReplyDelete
Replies
Unknown14 July 2014 at 09:31
The ontologies RDF is the only way to the pattern mining? Do you know others? I am interest to know more about your approach. Excelent presentation. Thank you Professor PETKO VALTCHEV.
ReplyDelete
Replies
Eltaani Redha14 July 2014 at 09:43
Thanks Petko, for this wonderful talk, use the class hierarchy and relation could be really the good way to predict pattern evolution and matching.
My question: what we have to do if the sequence is more distant than the general pattern that we seek!!! (We miss three or four components from expected sequence) ?
ReplyDelete
Replies
Louis Chartrand14 July 2014 at 11:00
La FCA a la réputation d'être assez memory-intensive. Est-ce que ça peut être un facteur limitant pour des applications dans de grosses bases de données?
ReplyDelete
Replies
Unknown14 July 2014 at 13:44
You said that we usually use only one type of links between data, but in the example that you presented with links as attractions and hostels is not two different links? Is it possible to structure a system with many types of links? As there is many possible links between data could we consider the ontology project as having the purpose to build them all? Is it possible to realize it if we take into account that links between items are also subjective?
ReplyDelete
Replies
Unknown14 July 2014 at 15:54
To what extent could we make an analogy between the semantic web and symbol grounding in humans? In humans, symbol grounding is the ability to pick out a referent in the real world. With the semantic web, we're giving machines the ability to pick out informational entities on the web (and deduce relations between these entities), by giving these entities semantic meaning which is "understandable" by machines. For example, humans have the ability to pick out, say, a museum in the real world, and conversely, thanks to linked data, a machine will be able to identify a museum on the web of data. The machine still isn't able to identify a museum in the real world, but if we transposed our real world into a world of data, then suddenly the machine is able to identify a museum within this data world.
ReplyDelete
Replies
Fernanda Pérez-Gay Juárez18 July 2014 at 08:59
I found the talk pretty informative; I was finally able to understand clearly the concepts of URIs, data and pattern mining. If we did an analogy between words and uris, is there a way to make uri's flexible in mening equivalence and recognize synonyms (entities that mean exactly the same thing and may be linked to the same data eventhough they are different itemset)? does that exist in data mining or can that be codified?
ReplyDelete
Replies
Petko V18 July 2014 at 20:20
This comment has been removed by the author.
ReplyDelete
Replies

Add comment