WorldCat Identities

Soulé-Dupuy, Chantal

Overview
Works: 29 works in 65 publications in 2 languages and 624 library holdings
Genres: Conference papers and proceedings 
Roles: Editor, Thesis advisor, Publishing director, htt, Other, Opponent, Author
Publication Timeline
.
Most widely held works by Chantal Soulé-Dupuy
Collaborative and social information retrieval and access : techniques for improved user modeling by Max Chevalier( )

13 editions published in 2009 in English and held by 467 WorldCat member libraries worldwide

"This book deals with the improvement of user modeling in the context of Collaborative and Social Information Access and Retrieval (CSIRA) techniques"--Provided by publisher
Advances in information retrieval : 31th [i.e. 31st] European Conference on IR Research, ECIR 2009, Toulouse, France, April 6-9, 2009 ; proceedings by Mohand Boughanem( )

11 editions published in 2009 in English and Undetermined and held by 78 WorldCat member libraries worldwide

This book constitutes the refereed proceedings of the 30th annual European Conference on Information Retrieval Research, ECIR 2009, held in Toulouse, France in April 2009. The 42 revised full papers and 18 revised short papers presented together with the abstracts of 3 invited lectures and 25 poster papers were carefully reviewed and selected from 188 submissions. The papers are organized in topical sections on retrieval model, collaborative IR / filtering, learning, multimedia - metadata, expert search - advertising, evaluation, opinion detection, web IR, representation, clustering / categorization as well as distributed IR
Documents annotés et langages d'indexation [articles issus des 26 et 27ème congrès INFORSID, Fontainebleau, 27-30 mai 2008 et Toulouse, 26-29 mai 2009] by Inforsid (France)( Book )

2 editions published in 2009 in French and held by 15 WorldCat member libraries worldwide

Conception des systèmes d'information : patrons et spécifications formelles( Book )

2 editions published in 2007 in French and held by 14 WorldCat member libraries worldwide

Entreposage de documents et données semi-structurées( Book )

2 editions published in 2007 in French and held by 7 WorldCat member libraries worldwide

Variabilité et adaptation de l'accès à l'information( Book )

2 editions published in 2014 in French and held by 5 WorldCat member libraries worldwide

Vers une adaptation dynamique et contextuelle des systèmes d'information( Book )

2 editions published in 2014 in French and held by 5 WorldCat member libraries worldwide

Vers la reconnaissance des intentions de communication : application au contenu de publications scientifiques by Hassan Kanso( Book )

2 editions published in 2009 in French and held by 3 WorldCat member libraries worldwide

L'intention derrière des actions humaines est un genre de connaissance difficile à appréhender à cause de son ambiguïté. Cette ambiguïté est due au fait que la notion d'intention est utilisée à la fois pour dénoter l'existence d'un but et l'existence d'un plan pour exécuter l'action. Les actions mentales de communication n'échappent pas à cette ambiguïté pour les mêmes raisons. L'hypothèse de base de cette thèse est que les auteurs sont conscients de l'effet qu'ils veulent produire au travers de leur écrit, de fait on ne s'intéressera qu'aux intentions souhaitées volontairement par les auteurs des documents. Pour cela plusieurs hypothèses sont nécessaires pour réduire la difficulté de ces travaux. Par exemple la limitation des types de documents (dans cette thèse les documents scientifiques sont ciblés). Les applications basées sur les services de traitement et de gestion de documents (Aide à la rédaction, Génération de texte, Recherche d'Information, Sélection de documents (Reviewing) etc.). gagneraient si la notion d'intention des auteurs était traitée de manière systématique. Pour cela un modèle des intentions utilisé comme base pour leur reconnaissance devrait contribuer à ce gain. Dans cette thèse, Nous proposons un modèle des intentions, une démarche pour la reconnaissance des intentions des documents scientifiques basée sur ce modèle. La validation de nos propositions se base sur la réalisation d'un outil de reconnaissance des intentions de communication des auteurs dans des documents (RICAD)
Perspectives de méta-analyse pour un environnement d'aide à la simulation et prédiction by William Raynaut( Book )

2 editions published in 2018 in French and held by 2 WorldCat member libraries worldwide

The emergence of the big data phenomenon has led to increasing demands in data analysis, which most often are conducted by other domains experts with little experience in data science. We then consider this important demand in intelligent assistance to data analysis, which receives an increasing attention from the scientific community. The first takes on the subject often possessing similar shortcomings, we propose to address it through new processes of meta-analysis. No evaluation standard having yet been set in this relatively new domain, we first propose a meta-analysis evaluation framework that will allow us to test and compare the developed methods. In order to open new approaches of meta-analysis, we then consider one of its recurring issue: dataset characterization. We then propose and evaluate such a characterization, consisting in a dissimilarity between datasets making use of a precise topological description to compare them. This dissimilarity allows a new meta-analysis approach producing recommendations of complete data analysis processes, which we then evaluate on a proof of concept. We thus detail the proposed methods of meta-analysis, and the associated process of assistance to data analysis
De la modélisation à l'exploitation des documents à structures multiples by Karim Djemal( Book )

2 editions published in 2010 in French and held by 2 WorldCat member libraries worldwide

With the recent development of new information and communication technologies, the paper documents are transformed to digital documents. Furthermore, it considers that the document is no longer seen as a whole, or as a monolithic bloc, but as organized entities. Exploiting these documents amount to identify and locate these entities. These entities are connected by relationships to give a "form" to document. Several types of relationships may occur, so that several "forms" of a document emerge. These different materializations of the same document are related to different uses of the same document and are essential for optimal management and shared of holdings. The work presented in this thesis aims to address the challenges of representing different materializations of a document through its representation of entities and their relationships. If those materializations are translated through structures, the issues are related to the representation of multistructured documents. Our work focuses mainly on the modeling, integration and exploitation of multistructured documents: (1) Proposal of multistructured document model. This model incorporates two levels of description: a specific level to describe each document through entities that compose and a generic level to identify document kinds through the grouping of similar structures. (2) Proposal of techniques for extracting structure (implicit or explicit) of a document (the specific level) and classification of this structure with respect to common structures (the generic level). The classification algorithm proposed includes a calculation of distance called "structural" (comparison of trees and graphs). This classification is associated with a process of verification of the "cohesion" of classes and possible reorganization of disrupted classes. (3) Proposal of document exploitation technical from their structures and their contents: (a) a document search that can reproduce documentary granules through criteria based on research of structures and / or content, (b) a multidimensional analysis that is to analyze and visualize the documentary information across multiple dimensions (of structures and / or content). In order to validate our proposals, we have developed a tool for integration and analysis of multistructured documents, called MDOCREP (Multistructured Document Repository). This tool provides on the one hand, the extraction and classification of document structures, and on the other hand, the querying and the multidimensional analysis of documents from their different structures
Entrepôts de documents : de l'alimentation à l'exploitation by Kaïs Khrouf( Book )

2 editions published in 2004 in French and held by 2 WorldCat member libraries worldwide

Nous proposons dans le cadre de cette thèse le concept d'entrepôt de documents permettant le stockage de documents hétérogènes, sélectionnés et filtrés, ainsi que leur classification selon des structures logiques génériques (structures communes à un ensemble de documents). Une telle organisation des entrepôts permet de faciliter l'exploitation des informations documentaires intégrées au travers de plusieurs techniques complémentaires : la recherche d'information qui consiste à restituer des granules de documents en réponse à une requête formulée à l'aide de mots-clés (langage libre), l'interrogation des données qui consiste à récupérer des données factuelles (de structure ou de contenu) en utilisant un langage déclaratif, l'analyse multidimensionnelle qui consiste à manipuler les informations de l'entrepôt selon des dimensions non prédéfinies.Pour valider nos propositions, nous avons développé un outil DOCWARE (DOCument WAREhouse) d'aide à l'intégration et à l'analyse de documents
Systemes de recherche d'informations. le systeme videotex infodiab, mecanismes d'indexation et d'interrogation by Chantal Soulé-Dupuy( Book )

2 editions published in 1990 in French and held by 2 WorldCat member libraries worldwide

LES TRAVAUX DE RECHERCHE PRESENTES DANS CE MEMOIRE CONSISTENT EN LA REALISATION D'UN SYSTEME DE RECHERCHE D'INFORMATIONS TEXTUELLES, INFODIAB, UTILISANT LE LANGAGE NATUREL COMME SOURCE D'INFORMATION ET COMME MOYEN D'INTERROGATION. CE SYSTEME REPOND A CERTAINES CONTRAINTES LIEES AU PUBLIC VISE (GRAND PUBLIC), A L'OUTIL VIDEOTEX ET AU CONTEXTE MEDICAL DE L'APPLICATION. NOTRE CONTRIBUTION A ALORS PERMIS: LA CONSTRUCTION D'UN MODELE DE REPRESENTATION DES CONNAISSANCES LEXICALES ET SEMANTIQUES AU MOYEN D'UN THESAURUS; LA MISE EN UVRE D'UNE PROCEDURE D'INDEXATION AUTOMATIQUE DES INFORMATIONS TEXTUELLES DE LA BASE PERMETTANT L'ORGANISATION DE MOTS SIMPLES ET COMPOSES, DE SYNTAGMES ET DE RELATIONS SEMANTIQUES ENTRE CES MOTS ET GROUPES DE MOTS (SYNONYMIE, HIERARCHIE); L'ELABORATION D'UNE PROCEDURE D'INTERROGATION SOUPLE ET CONVIVIALE DESTINEE A TOUT UTILISATEUR. CES PROCEDURES REPOSENT ESSENTIELLEMENT SUR DES ANALYSES MORPHOLEXICALES ET STATISTIQUES. AUSSI, APRES AVOIR RESITUE LES SYSTEMES DE RECHERCHE D'INFORMATIONS DANS LES DIFFERENTS CONTEXTES AMENANT A LEUR DEVELOPPEMENT, NOUS INTRODUISONS LES PRINCIPAUX CONCEPTS PROPRES AU DOMAINE DE LA RECHERCHE D'INFORMATION. NOUS PRESENTONS ENFIN LES MECANISMES D'INDEXATION ET D'INTERROGATION MIS EN UVRE DANS INFODIAB AINSI QUE LES EXTENSIONS EN COURS D'ETUDE EN VUE D'EVENTUELLES OPTIMISATIONS DU PROCESSUS DE RECHERCHE
Gestion de l'hétérogénéité documentaire : le cas d'un entrepôt de documents multimédia by Mohamed Mbarki( Book )

2 editions published in 2008 in French and held by 2 WorldCat member libraries worldwide

The knowledge society is based on three axes: the diffusion and use of information via new technologies, the deduction of knowledge induced by this information and the economic impacts which can result from this information. To offer to the actors and more particularly to the "decision makers" of this society some tools which enable them to produce and manage "knowledge" or at least "elements of knowledge" seem to be rather difficult to ensure. This difficulty is due to the dynamism of the environment and the diversity of factors influencing the information production, extraction and communication. Indeed, this information is included in documents which are collected from disseminated sources (Internet, Workflow, numerical libraries, etc.). These documents are thus heterogeneous on the content and on the form (they can be related to various fields, they can be more or less structured, they can have various structures, they contain several type of media, are stored in several type of supports, etc). The current challenges are to conceive new applications to exploit this document heterogeneity. Having in mind these needs, the work presented in my thesis, aims to face these challenges and in particular at proposing solutions in order "to manage and create knowledge" starting from the integration of all information available on the heterogeneous documents. The handling of multimedia documents repositories constitutes the applicative framework of our proposals. Our approach is articulated around three complementary axes: (1) the representation, (2) storage (or integration) and (3) exploitation of the heterogeneous documents. Documents representation is related to the determination of information that must be preserved and the way according to which they must be organized to offer better apprehending and envisaging of their uses. The solution that we chose to meet these needs bases on the proposal for a documents model which integrates several overlapping and complementary levels of description (a generic layer and a specific one, a logical description and a semantic one)
Modélisation et exploitation de profils : accès sémantique à des ressources by Pascaline Laure Tchienehom( Book )

2 editions published in 2006 in French and held by 2 WorldCat member libraries worldwide

L'accès à des ressources est une vision plus large de l'accès à l'information où les ressources peuvent être étendues à toutes sortes de catégories de personnes, choses ou actions. L'hétérogénéité de ces ressources a conduit au développement de nombreuses méthodes d'accès. Ces méthodes sont basées sur la description des ressources utilisées, que l'on appelle profil, et sur la définition de principes d'exploitation de ces descriptions pour la réalisation d'une tâche spécifique (recherche, filtrage, etc.). Les modèles de profils et principes d'exploitation de ces derniers diffèrent d'une application à une autre. Afin de faire collaborer différentes applications, il y a un réel besoin de définir un cadre homogène et flexible de modélisation et d'exploitation de profils. Nos travaux visent à proposer des solutions sur ces deux aspects, au travers d'un modèle générique de profil ainsi que de méthodes d'analyse sémantique et d'appariement d'instances de ce modèle. Pour valider nos propositions, un outil d'aide à la construction, à la visualisation et à l'analyse sémantique de profils a été implémenté. De plus, une évaluation des méthodes d'analyse sémantique et d'appariement de profils proposées a été effectuée
Intégration automatisée de l'expertise du patient dans le suivi à distance de sa pathologie chronique by Amira Derradji( )

1 edition published in 2017 in French and held by 2 WorldCat member libraries worldwide

For several years, the deployment of information and communication technologyintothemanagementofchronicalpathologiesistakingaconsiderableplace, more particularly in the evolution of health's practices and in the improvement of the well-being of the patient Chronical pathologies are of long duration and they need to be under a regular monitoring of the healthcare professional, composed of multidisciplinary or different actors in charge with the patients. On the other side the patients are alsochargedoffollowingahealthcareprotocolathomepreviouslydefinedbythe health care team. Nevertheless, the different forms of representing the contests of this protocol, it is not always complete and comprehensible for the patients. Furthermore, each one of the patients is unique and a proper definition of the health care protocol must be personalised and conform to his individual treatment and even to his personal wishes or constraints. But this is not the case of information guides or medical references that are supplied in general. With the intent to improve the interaction between the patient and the healthcareprofessionalsrelatedtothehealthcareprotocol,wepropose(i)alanguagefor the computerised representation of the healthcare protocol, sibling the healthcare professionals and the patients, enough simple, intuitive and easy to understand, (ii) an ontology for the patient expertise (based on his experience on the disease) allowingsotheinteractionofthepatientwithhishealthcareprotocolbyreporting all the unexpected behaviours. These behaviours are events that are not defined in the initial health care protocol
Détection de problèmes de qualité dans les ontologies construites automatiquement à partir de textes by Toader Gherasim( Book )

2 editions published in 2013 in French and held by 2 WorldCat member libraries worldwide

The growing use of ontologies in a variety of application areas has stimulated the development of approaches proposing different degrees of automation of the ontology construction process. However, despite the real interest of these approaches, sometimes their results are of low quality. The aim of the work presented in this thesis is to contribute to the improvement of the quality of ontologies constructed automatically from texts. Our main contributions are : (1) a method for the comparison of the approaches, (2) a typology of problems that affect the quality of ontologies, and (3) a first reflection on automating the detection of quality problems. Our method for the comparison of approaches consists of three complementary steps : (1) on the basis of their degree of automation and completeness, (2) on the basis of their technical and functional characteristics, and (3) experimentally by comparing their results with a manually constructed ontology. The proposed typology organizes the quality problems according to two dimensions : errors versus unsuitable situations and logical aspects versus social aspects. Our typology contains 24 classes of problems that cover and complement the problems described in the literature. Concerning the automatic detection we have inventoried some of the existing methods for each problem in our typology and we have highlighted the problems for which the automatic detection remains an open issue. We have also proposed a heuristic for the detection of a quality problem that appears frequently in our experimentations (polysemic labels)
Credit card fraud detection using machine learning with integration of contextual knowledge by Yvan Lucas( )

2 editions published in 2019 in English and held by 2 WorldCat member libraries worldwide

The detection of credit card fraud has several features that make it a difficult task. First, attributes describing a transaction ignore sequential information. Secondly, purchasing behavior and fraud strategies can change over time, gradually making a decision function learned by an irrelevant classifier. We performed an exploratory analysis to quantify the day-by-day shift dataset and identified calendar periods that have different properties within the dataset. The main strategy for integrating sequential information is to create a set of attributes that are descriptive statistics obtained by aggregating cardholder transaction sequences. We used this method as a reference method for detecting credit card fraud. We have proposed a strategy for creating attributes based on Hidden Markov Models (HMMs) characterizing the transaction from different viewpoints in order to integrate a broad spectrum of sequential information within transactions. In fact, we model the authentic and fraudulent behaviors of merchants and cardholders according to two univariate characteristics: the date and the amount of transactions. Our multi-perspective approach based on HMM allows automated preprocessing of data to model temporal correlations. Experiments conducted on a large set of data from real-world credit card transactions (46 million transactions carried out by Belgian cardholders between March and May 2015) have shown that the proposed strategy for pre-processing data based on HMMs can detect more fraudulent transactions when combined with the Aggregate Data Pre-Processing strategy
Structuration sématique de documents XML centres-documents by Salma Ben Meftah( )

1 edition published in 2017 in French and held by 1 WorldCat member library worldwide

Le résumé en anglais n'a pas été communiqué par l'auteur
An Efficient Classification Model for Analyzing Skewed Data to Detect Frauds in the Financial Sector by Sara Makki( )

1 edition published in 2019 in English and held by 1 WorldCat member library worldwide

There are different types of risks in financial domain such as, terrorist financing, money laundering, credit card fraudulence and insurance fraudulence that may result in catastrophic consequences for entities such as banks or insurance companies. These financial risks are usually detected using classification algorithms. In classification problems, the skewed distribution of classes also known as class imbalance, is a very common challenge in financial fraud detection, where special data mining approaches are used along with the traditional classification algorithms to tackle this issue. Imbalance class problem occurs when one of the classes have more instances than another class. This problem is more vulnerable when we consider big data context. The datasets that are used to build and train the models contain an extremely small portion of minority group also known as positives in comparison to the majority class known as negatives. In most of the cases, it's more delicate and crucial to correctly classify the minority group rather than the other group, like fraud detection, disease diagnosis, etc. In these examples, the fraud and the disease are the minority groups and it's more delicate to detect a fraud record because of its dangerous consequences, than a normal one. These class data proportions make it very difficult to the machine learning classifier to learn the characteristics and patterns of the minority group. These classifiers will be biased towards the majority group because of their many examples in the dataset and will learn to classify them much faster than the other group. After conducting a thorough study to investigate the challenges faced in the class imbalance cases, we found that we still can't reach an acceptable sensitivity (i.e. good classification of minority group) without a significant decrease of accuracy. This leads to another challenge which is the choice of performance measures used to evaluate models. In these cases, this choice is not straightforward, the accuracy or sensitivity alone are misleading. We use other measures like precision-recall curve or F1 - score to evaluate this trade-off between accuracy and sensitivity. Our objective is to build an imbalanced classification model that considers the extreme class imbalance and the false alarms, in a big data framework. We developed two approaches: A Cost-Sensitive Cosine Similarity K-Nearest Neighbor (CoSKNN) as a single classifier, and a K-modes Imbalance Classification Hybrid Approach (K-MICHA) as an ensemble learning methodology. In CoSKNN, our aim was to tackle the imbalance problem by using cosine similarity as a distance metric and by introducing a cost sensitive score for the classification using the KNN algorithm. We conducted a comparative validation experiment where we prove the effectiveness of CoSKNN in terms of accuracy and fraud detection. On the other hand, the aim of K-MICHA is to cluster similar data points in terms of the classifiers outputs. Then, calculating the fraud probabilities in the obtained clusters in order to use them for detecting frauds of new transactions. This approach can be used to the detection of any type of financial fraud, where labelled data are available. At the end, we applied K-MICHA to a credit card, mobile payment and auto insurance fraud data sets. In all three case studies, we compare K-MICHA with stacking using voting, weighted voting, logistic regression and CART. We also compared with Adaboost and random forest. We prove the efficiency of K-MICHA based on these experiments
Variabilité et adaptation de l'accès à l'information( Book )

1 edition published in 2014 in French and held by 1 WorldCat member library worldwide

 
moreShow More Titles
fewerShow Fewer Titles
Audience Level
0
Audience Level
1
  Kids General Special  
Audience level: 0.25 (from 0.11 for Vers une a ... to 0.96 for Intégrati ...)

Collaborative and social information retrieval and access : techniques for improved user modeling
Covers
Advances in information retrieval : 31th [i.e. 31st] European Conference on IR Research, ECIR 2009, Toulouse, France, April 6-9, 2009 ; proceedings
Alternative Names
Dupuy, Chantal Soulé-

Languages
French (29)

English (26)