BDA'19
35ème Conférence sur la Gestion de Données – Principes, Technologies et Applications.
35ème Conférence sur la Gestion de Données – Principes, Technologies et Applications.
Les 35èmes journées de la conférence BDA « Gestion de Données – Principes, Technologies et Applications », rendez-vous incontournable de la communauté gestion de données en France, auront lieu cette année à Lyon, du 15 au 18 octobre 2019, sur le Campus Lyon Tech à la Doua (Laboratoire LIRIS, INSA de Lyon & Université Lyon 1, Villeurbanne).
La recherche en gestion de données n’a jamais été aussi active, variée, ouverte sur d’autres champs de l’informatique et, au-delà, sur les grands défis des applications modernes. Poursuivant la tradition de rencontres annuelles de la communauté de gestion de données francophone, BDA 2019 invite académiques et industriels à soumettre leurs travaux récents pour rendre compte des défis et des avancées scientifiques et industrielles dans ce domaine en pleine effervescence.
BDA 2019 invite des participants académiques et industriels à soumettre leurs travaux récents pour rendre compte des défis et des avancées scientifiques et industrielles dans ce domaine extrêmement dynamique. Par ailleurs, elle encourage les soumissions aux interfaces de la gestion de données. Plus généralement, nous sollicitons des contributions originales ciblant les sujets suivants (liste non exhaustive) : Humanités numériques (histoire, musicologie, culture et patrimoine, gestion, droit, etc.), bio-informatique et santé, TAL, ville intelligente, Smart building, green IT, ...
Les soumissions attendues peuvent aussi bien développer des approches théoriques que pratiques : modélisations, techniques, architectures techniques, plateformes, expérimentations ou études empiriques.
Plusieurs catégories de contributions sont attendues :
Les articles pourront être soumis en français ou en anglais, sous forme de documents PDF en format ACM sigconf. Le site de soumission pour toutes les catégories de contributions est :
Pour chaque catégorie, deux types de soumission sont acceptés :
Lors de la soumission, les auteurs sont invités à choisir le type de soumission (P ou NP). Les papiers sont évalués selon les mêmes critères, dans des flux différents. La proportion des papiers P sera limitée (fraction à la discrétion du comité de programme).
La publication des actes se fait en deux temps :
Comme chaque année, les meilleures publications longues NP pourront être invitées pour un numéro spécial de revue. Par ailleurs, les meilleurs articles de recherche longs seront invités à un numéro spécial de la revue internationale Transactions on Large-Scale Data and Knowledge-Centered Systems (TLDKS ). Pour les articles P, l’article invité est une extension de la version déjà publiée qui doit comporter 30% de valeur ajoutée.
Cette année, le comité décernera un prix du meilleur article de recherche, en plus du prix habituel de la meilleure démonstration.
Pour la première fois en 2019, la communauté BDA distingue par un prix (et éventuellement un accessit) une thèse réalisée dans le domaine de la gestion de données (au sens large). Sont éligibles les doctorant(e)s:
Le prix distinguera les contributions apportées pendant la thèse, que ce soit au niveau formel, théorique, d'architecture, et/ou de développement d'algorithmes, prototypes ou systèmes. Sont éligibles toutes les thèses soutenues dans les conditions ci-dessus, dans les domaines scientifiques concernés par les appels à communications récents des conférences BDA (2019, 2018 , 2017 , 2016 ).
Les candidat(e)s devront soumettre au plus tard le 15/07/2019 un seul fichier PDF contenant :
La soumission se fera par EasyChair à l'adresse :
Il est recommandé que les publications et autres réalisations associées à la thèse soient consultables en ligne lors de la soumission.
Les candidats n'ayant pas reçu l'ensemble des documents à temps pour la soumission, doivent les fournir au jury dès que possible et avant le 14/10/2019.
Les propositions seront évaluées par un comité d'experts dans les domaines scientifiques concernés par le prix. Les candidats doivent avoir fourni au jury l'ensemble des documents demandés avant le 14/10/2019.
Le comité proposera un prix et éventuellement un accessit. Le ou les jeunes docteurs concernés se verront décerner le prix lors de la conférence BDA 2019 à Lyon; ils ou elles pourront bénéficier d'une inscription gratuite à la conférence.
Prix du Meilleur Article :
Ousmane Issa, Angela Bonifati et Farouk Toumani
pour l'article « A Relational Framework for Inconsistency-aware Query Answering »
Prix du Meilleur Article :
Thomas Minier, Hala Skaf-Molli et Pascal Molli
pour l'article « SaGe: Web Preemption for Public SPARQL Query Services »
Prix de la Meilleure Démonstration :
Yanlei Diao, Pawel Guzewicz, Ioana Manolescu et Mirjana Mazuran
pour la démonstration « Spade: a Modular Framework for Analytical Exploration of RDF Graphs »
Prix de Thèse :
Michele Linardi
pour sa thèse intitulée « Variable-length Similarity Search for Very Large Data Series: Subsequence matching, Motif and Discord Detection »
[DBLP ]
Prix de Thèse :
Mikaël Monet
pour sa thèse intitulée « Combined Complexity of Probabilistic Query Evaluation »
[HAL ]
Accessit au Prix de Thèse :
Ugo Comignani
pour sa thèse intitulée « Interactive Mapping Specification and Repairing in the Presence of Policy Views »
[HAL ]
Boualem Benatallah, University of New South Wales (UNSW, Sydney, Australia)
CONFÉRENCE ANNULÉE
Cognitive services and their instantiation in the form of messaging or chat bots, task-oriented conversational bots, software robots, digital or virtual assistants, are today used by millions of users. Increasingly, organisations have started to use conversational AI to augment and improve productivity and effectiveness of their customers, workers and stake-holders, automate business processes, deliver data-driven insights. However, there are significant gaps in the cognitive service-enabled endeavour. From engineering perspective, bot developer defines goals for the bot, selects or creates relevant user intents and API(s) providing the service, trains the bot extensively to identify the intents and parameters, and manually translates intents into API calls and conversations with users. We postulate that the ubiquity of cognitive services will have little value if they cannot easily integrate and reuse concomitant capabilities across large number of evolving and heterogeneous devices, data sources and applications. At the same time, APIs are unlocking application, data source and device silos through standardised interaction protocols and access interfaces. Today, within the Web and Mobile development community, complex applications are being stringed together with a few lines of code – all made possible by APIs and their composition.
To leverage the opportunities that APIs bring, we need bot development to 'scale' in terms of how efficiently and effectively they can integrate with potentially large number of evolving APIs. We will discuss some critical challenges to achieve this objective. First, a core challenge is the lack of latent and rich intent and APIs knowledge to effectively and efficiently support dynamic mapping of complex and context-specific user intents to API calls. Second, user intent may be complex and its realisation requires composition of multiple APIs (e.g., triggering multiple APIs to control IoT devices using one user utterance). Existing intent composition techniques typically rely on inflexible and costly methods including extensive intent training or development of complex and hard-coded intent recognition rules. We will discuss challenges in API aware training of cognitive services. We will discuss novel latent knowledge-powered middleware techniques and services to accelerate bot development pipelines by: (i) devising novel intent and API element embeddings and matching techniques, (ii) declaratively specifying reusable and configurable conversation models to support complex user intent provisioning; and (iii) dynamically synthesising API calls instead of ad hoc, rule-based and costly development of intent-to-executable-code mappings.
Prof. Boualem Benatallah is a scientia professor and research group leader at the School of Computer Science (CSE), University of New South Wales (UNSW, Sydney, Australia). His main research interests are developing fundamental concepts and techniques in service composition, cloud services orchestration, quality control in crowdsourcing services, cognitive services and case management, hybrid computation for software vulnerability discovery, legal investigations and systematic literature reviews. He has published more than 270 publications (including 240 refereed papers). A large number of his papers appeared in very selective and reputable conferences and journals. He has been frequently invited to give keynote talks, lectures and tutorials in international conferences and summer schools. His work on service composition, in collaboration with colleagues and students, made pioneering contributions, notably on integrating mathematical optimization techniques and service composition along with declarative composition. Boualem has been PC co-chair of number of international conferences including BPM'2005, ICSOC'2005, WISE'2007, ICWE'2010, IEEE/ACM WI'2011, IEEE SOCA'2011. He was research track co-chair for the WWW'2011 conference. He was the general chair of ICSOC'2008 and BPM’2018. He acted as a key official (tutorial chair, workshops chair, publication chair, area chair, PhD symposium chair) for several international conferences. He has been guest editor of number of special issues for reputable international journals including ACM TOIT and Parallel and Distributed Databases. He has been a PC member of all the reputable international conferences in his areas of research including VLDB, ICDE, WWW, EDBT, MDM, ICSOC, ICWS and ER. He is member of the steering committee of BPM and ICSOC conferences. He is member of the editorial board of numerous international journals and series including ACM Transactions on Web, IEEE transactions on Cloud Computing, Parallel and Distributed Databases, and Computing. He was a visiting Professor at INRIA-LORIA, CNRS, Claude Bernard University (France), University of Blaise Pascal (Clermont Ferrand, France), University of Trento (Italy), University of Paris Dauphine (France). As the chair of the CSE research committee, he was member of the team (comprising multiple university, government and industry partners) that constructed the successful bid for the Smart Services CRC (Cooperative Research Centre). He is a leader of the data curation research stream at the Data to Decisions CRC. He was the leader of major collaborative and cross-institutional projects at the Smart Services and D2D CRCs, involving academic, government and industry partners. He is member of Executive Committee of IEEE Computer Society's Technical Committee on Business Informatics and Systems.
Leonid Libkin , University of Edinburgh (Scotland, United Kingdom)
This talk grew out of our work on handling nulls in SQL - and in the process our attempt to understand the semantics of SQL. It led to quite a few surprises, reviewed here mainly as a collection of anecdotes, of several kinds:
I shall also talk about what we do to address these issues, for example, providing a formal semantics of actual SQL, without any shortcuts, and understanding the necessity of SQL design choices, such as the much criticized 3-valued logic.
Prof. Leonid Libkin is Professor of Foundations of Data Management in the School of Informatics at the University of Edinburgh. He was previously a Professor at the University of Toronto and a member of research staff at Bell Laboratories in Murray Hill. He received his PhD from the University of Pennsylvania in 1994. His main research interests are in the areas of data management and applications of logic in computer science. He has written five books and over 200 technical papers. His awards include a Marie Curie Chair Award, a Royal Society Wolfson Research Merit Award, and six Best Paper Awards. He has chaired programme committees of major database conferences (ACM PODS, ICDT) and was the conference chair of the 2010 Federated Logic Conference. He has given many invited conference talks and has served on multiple program committees and editorial boards. He is an ACM fellow, a fellow of the Royal Society of Edinburgh, and a member of Academia Europaea.
Amr El Abbadi , University of California (Santa Barbara, USA)
(in collaboration with: Divy Agrawal, Mohammad Amiri, Sujaya Maiyya, Victor Zakhary)
Once upon a time databases were structured, one size fit all and they resided on machines that were trustworthy and even when they failed, they simply crashed. This era has come and gone as eloquently stated by Mike Stonebraker. We now have key-value stores, graph databases, text databases, and a myriad of unstructured data repositories. However, we, as a database community still cling to our 20th century belief that databases always reside on trustworthy, honest servers. This notion has been challenged and abandoned by many other Computer Science communities, most notably the security and the distributed systems communities.
The rise of the cloud computing paradigm as well as the rapid popularity of blockchains demand a rethinking of our naïve, comfortable beliefs in an ideal benign infrastructure. In the cloud, clients store their sensitive data in remote servers owned and operated by cloud providers. The Security and Crypto Communities have made significant inroads to protect both data and access privacy from malicious untrusted storage providers using encryption and oblivious data stores. The Distributed Systems and the Systems Communities have developed consensus protocols to ensure the fault-tolerant maintenance of data residing on untrusted, malicious infrastructure. However, these solutions face significant scalability and performance challenges when incorporated in large scale data repositories. Novel database design needs to directly address the natural tension between performance, fault-tolerance and trustworthiness. This is a perfect setting for the database community to lead and guide.
In this talk, I will discuss the state of the art in terms of data management in malicious, untrusted settings, its limitations and potential approaches to mitigate these shortcomings. As examples, I will use cloud and distributed databases that reside on untrustworthy malicious infrastructure and discuss specific approachs for standard database problems like commitment and replication. I will also explore blockchains, which can be viewed as asset management databases in untrusted infrastructures. In this context, I will discuss recent approaches to improve transaction throughput in blockchains, as well as recent solutions to achieve atomic commitment among multiple not trusting blockchains.
Prof. Amr El Abbadi is a Professor of Computer Science at the University of California, Santa Barbara. He received his B. Eng. from Alexandria University, Egypt, and his Ph.D. from Cornell University. His research interests are in the fields of fault-tolerant distributed systems and databases, focusing recently on Cloud data management and blockchain based systems. Prof. El Abbadi is an ACM Fellow, AAAS Fellow, and IEEE Fellow. He was Chair of the Computer Science Department at UCSB from 2007 to 2011. He has served as a journal editor for several database journals, including, The VLDB Journal, IEEE Transactions on Computers and The Computer Journal. He has been Program Chair for multiple database and distributed systems conferences. He currently serves on the executive committee of the IEEE Technical Committee on Data Engineering (TCDE) and was a board member of the VLDB Endowment from 2002 to 2008. In 2007, Prof. El Abbadi received the UCSB Senate Outstanding Mentorship Award for his excellence in mentoring graduate students. In 2013, his student, Sudipto Das received the SIGMOD Jim Gray Doctoral Dissertation Award. Prof. El Abbadi is also a co-recipient of the Test of Time Award at EDBT/ICDT 2015. He has published over 300 articles in databases and distributed systems and has supervised over 35 PhD students.
Behrooz Omidvar-Tehrani , University of Grenoble Alpes (Grenoble, France)
Sihem Amer-Yahia , University of Grenoble Alpes & CNRS (Grenoble, France)
Diaporama du tutoriel
Mardi 15 octobre de 9h00 à 10h30. Calendrier des Journées
User data is becoming increasingly available in various domains ranging from the social Web to electronic patient health records (EHRs). User data is characterized by a combination of demographics (e.g., age, gender, life status) and user actions (e.g., posting a tweet, following a diet). Domain experts rely on user data to conduct large-scale population studies. Information consumers, on the other hand, rely on user data for routine tasks such as finding a book club and getting advice from look-alike patients. User data analytics is usually based on identifying group-level behaviors such as “teenage females who watch Titanic” and “old male patients in Paris who suffer from Bronchitis.” In this tutorial, we review data pipelines for User Group Analytics (UGA). These pipelines admit raw user data as input and return insights in the form of user groups. We review research on UGA pipelines and discuss approaches and open challenges for discovering, exploring, and visualizing user groups. Throughout the tutorial, we will illustrate examples in two key domains: “the social Web” and “health-care”.
Mohamed-Amine Baazizi , LIP6, Sorbonne Université (Paris, France)
Dario Colazzo , Université Paris-Dauphine (Paris, France)
Giorgio Ghelli , Università di Pisa (Pisa, Italia)
Carlo Sartiani , Università della Basilicata (Potenza, Italia)
Documents du tutoriel
Mardi 15 octobre de 10h30 à 12h30. Calendrier des Journées
The last few years have seen the fast and ubiquitous diffusion of JSON as one of the most widely used formats for publishing and interchanging data, as it combines the flexibility of semistructured data models with well-known data structures like records and arrays. The user willing to effectively manage JSON data collections can rely on several schema languages, like JSON Schema, JSound, and Joi, as well as on the type abstractions offered by modern programming and scripting languages like Swift or TypeScript. The main aim of this tutorial is to provide the audience (both researchers and practitioners) with the basic notions for enjoying all the benefits that schema and types can offer while processing and manipulating JSON data. This tutorial focuses on four main aspects of the relation between JSON and schemas: (1) we survey existing schema language proposals and discuss their prominent features; (2) we analyze tools that can infer schemas from data, or that exploit schema information for improving data parsing and management; and (3) we discuss some open research challenges and opportunities related to JSON data.
Amr El Abbadi , University of California (Santa Barbara, USA)
Diaporama du tutoriel (cf. programme)
Mardi 15 octobre de 10h30 à 12h30. Calendrier des Journées
The uprise of Bitcoin and other peer-to-peer cryptocurrencies has opened many interesting and challenging problems in cryptography, distributed systems, and databases. The main underlying data structure is blockchain, a scalable fully replicated structure that is shared among all participants and guarantees a consistent view of all user transactions by all participants in the system. In this tutorial, we discuss the basic protocols used in blockchain, and elaborate on its main advantages and limitations. To overcome these limitations, we provide the necessary distributed systems background in managing large scale fully replicated ledgers, using Byzantine Agreement protocols to solve the consensus problem. Finally, we expound on some of the most recent proposals to design scalable and efficient blockchains in both permissionless and permissioned settings. The focus of the tutorial is on the distributed systems and database aspects of the recent innovations in blockchains.
Les inscriptions sont ouvertes : Site d'inscription
avant le |
après le |
|
---|---|---|
Étudiant | 250.00€ | 350.00€ |
Autre | 350.00€ | 450.00€ |
Autre = académique, industriel, etc.
Note 1 : lors de l'inscription il est aussi possible d'acheter des tickets supplémentaires pour le dîner de gala du jeudi soir au prix de 90.00€.
Note 2 : toujours lors de l'inscription, pensez à choisir le tutoriel auquel vous prévoyez d'assister trois tutoriels sont proposés en parallèle mardi matin).
ATTENTION : Pour le bon fonctionnement du site d'inscription, pensez à autoriser les fenêtres pop-up pour que la page de paiement puisse s’ouvrir.
Pour les entreprises ou structures souhaitant être sponsor ou apporter leur soutien à l'évènement, merci de nous contacter.
Les inscriptions sont ouvertes : Site d'inscription
Le Campus de LyonTech - la Doua regroupe un ensemble d'établissements lyonnais de l'enseignement supérieur en sciences et technologies, en particulier l'Université Claude Bernard Lyon 1 et l'INSA de Lyon, au nord de Villeurbanne. La conférence BDA'19 a lieu au cœur du campus dans le bâtiment de la Rotonde et la bibliothèque Marie Curie.
Le campus est facile d'accès en transport en commun, depuis l'aéroport Lyon Saint-Exupéry (via le Rhône Express) et depuis la gare Lyon Part-Dieu (ou Lyon Perrache). Le Tramway T1 ou T4 vous amène jusqu'aux portes de l'INSA, à l'arrêt "La Doua Gaston Berger" ou à l'arrêt "INSA Einstein" (T1 uniquement).
Hôtels proches du Campus | Hôtels proches de la gare Part-Dieu | Auberges de Jeunesse |
---|---|---|
Hôtel IBIS STYLES 130 Boulevard du 11 Novembre 1918 69100 Villeurbanne tel: +33 (0)4 78 89 95 95 Prix entre 70 et 100€ par nuit, petit-déjeuner inclus |
Campanile Lyon Part-Dieu 31 rue Maurice Flandin 69003 Lyon tel: +33 (0)4 72 36 31 00 Prix entre 75 et 86€ par nuit, hors petit-déjeuner |
Auberge de Jeunesse Le Flâneur Guesthouse 56, rue Sébastien Gryphe 69007 Lyon tel: +33 (0)9 81 99 16 97 |
Hôtel des Congrès Place du Commandant Rivière 69100 Villeurbanne tel: +33 (0)4 72 69 16 16 Prix entre 124 et 144€ par nuit, petit-déjeuner inclus |
Hôtel IBIS STYLES 54 Rue de la Villette 69003 Lyon tel: +33 (0)4 72 68 25 40 Prix entre 70 et 100€ par nuit, petit-déjeuner inclus |
Auberge de Jeunesse Away Hostel and Coffeeshop 21 Rue d'Alsace-Lorraine 69001 Lyon tel: +33 (0)4 78 98 53 20 |
Hôtel Mercure Charpennes 7 place Charles Hernu 69100 Villeurbanne tel: +33 (0)4 88 70 08 65 Prix entre 111 et 138€ par nuit, petit-déjeuner inclus |
Hôtel Mercure 50 Rue de la Villette 69003 Lyon tel: +33 (0)4 72 68 25 20 Environ 149€ par nuit, hors petit-déjeuner |
Hôtel proche du centre-ville |
Hôtel des Célestins 4 rue des Archers 69002 Lyon tel: +33 (0)4 72 56 08 98 Prix entre 110€ et 140€ par nuit |