Université de Paris

45, rue des Saints Pères
75270 Paris cedex 06

E-Mail: Soror . Sahri at_parisdescartes.fr
Phone: +33 1 83 94 57 91 begin_of_the_skype_highlighting


                            Research                 Publications                     Teaching   


I am Associate Professor at Université de Paris and member of Data Intensive and Knowledge Oriented Systems group (diNo) of LIPADE (Laboratoire d’Informatique Paris Descartes). I received my PhD in computer science from Paris-Dauphine University, where I worked on scalable and distributed databases.



Research interests


My research interests are on topics relevant to large-scale data management and data quality.


The first focus of my research is to propose prototypes and proof of concepts to manage a very large volume data in a distributed manner by considering the storage, model and query processing of data. We recently focus on managing uncertain data over distributed environments. We proposed efficient and effective query processing methods leveraging distributed indexing techniques.


The second one is to consider data quality in a large-scale context, particularly deduplication. As part of our methodology to ensure data quality, we use data quality rules and the recent existing paradigms (ex. crowdsourcing and linked open data). We recently proposed a framework that automatically detects duplicates through the application of a new introduced class of data quality rules, that is derived from existing rules in literature such as: conditional functional dependencies and matching dependencies.


My current focus is to propose frameworks dealing with quality assessment of Big Data in term of its veracity and value.




R. Moussa, S. Sahri. Customized Eager-Lazy Data Cleansing for Satisfactory Big Data Veracity.
IDEAS 2021: 25th International Database Engineering & Applications Symposium. July 2021, pp 157-165.

A. Benaissa, S. Sahri, M. Ouziri. Top-k Queries over Distributed Uncertain Categorical Data. Trans. Large Scale Data Knowl. Centered Syst. 43: 40-61, 2020.

Abidi. L, Azzag. H, Benbernou. S, Bentounsi. M, Cérin. C, Duong. T, Garteiser. P, Lebbah. M, Ouziri. M, Sahri. S, Smadja. M. A Big Data Platform for Enhancing Life Imaging Activities. In book: Utilizing Big Data Paradigms for Business Intelligence. Chapter: 2. Publisher: IGI Global. Editors: J. Darmont. Pages 39-71, 2018.


Moussa. R, Cuzzocrea. A, Sahri. S. Big Data Management and Processing, chapter 9 SQL-on-Hadoop Systems: State-of-the-Art Exploration, Models, Performances, Issues, and Recommendations. CRC Press Francis & Taylor, 2017. 


Abourra. A, Sahri. S, Baba-Hamed. L, Ouziri. M, Benbernou. S. Quality-based Online Data Reconciliation. Journal ACM Transactions on Internet Technology. Volume 16 Issue 1, February 2016.


Abourra. A, Sahri. S, Ouziri. M, Benbernou. S: CrowdMD: Crowdsourcing-based approach for deduplication. Data quality Issues Workshop in conjunction with IEEE International Conference on Big Data, 2621-2627. Santa Clara, 2015.


Benaissa. A, Benbernou. S, Ouziri. M, Sahri. S. Indexing uncertain categorical data over distributed environment. Conference of the International Fuzzy Systems Association and the European Society for Fuzzy Logic and Technology (IFSA-EUSFLAT-15), Gijon, Spain. 2015.


I. Benamor, M. Ouziri, S. Sahri, N. Karam. Be a Collaborator and a Competitor in Crowdsourcing System, IEEE 22nd International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), 2014, Paris.


Sahri. S, M. Ouziri, S. Benbernou. Summary-based Pattern Tableaux Generation for Conditional Functional Dependencies in Distributed Data. Accepted in the 25th International Conference on Database and Expert Systems Applications (DEXA), Munich, Germany, 2014.


Sahri. S, Moussa. R, Long. D. Benbernou. S. DBaaS-Expert: A Recommender for the Selection of the Right Cloud Database. In: 21st International Symposium on Methodologies for Intelligent Systems (ISMIS). Roskilde. Denmark. 2014.


Sahri. S, Schwarz. T. Design Issues of Shingled Write Disk for Database Table Implementation. Journal of Computers, Academy Publishers, ISSN 1796-203X, 2014.


Mokhtari, K., Benbernou, S., Sahri, S., V. Andrikopoulos, V., F. Leymann, F., Hacid, M. Timed Privacy-Aware Business Protocols. International Journal of Cooperative Information Systems, vol. 21 (02), pp. 85-109, 2012.


Yakouben, H. & Sahri, S. LH*RS P2P: a High Available and Distributed P2P Data Structure. Int. J. of Internet Technology and Secured Transactions. Volume 2, Number 1-2 /2010. Pages 5-31.


Sahri, S., Litwin, W. & Schwarz, Th. Scalable Web Services Interface for SD-SQL Server. The 3rd International ICST Conference on Scalable Information Systems. Infoscale 2008.


Litwin, W., Sahri, S. & Schwarz, ThNew Features for a Scalable Distributed Databases Management in SD-SQL Server 2006. Gong Show in The 3rd Biennial Conference on Innovative Data Systems Research, CIDR 2007, January 7-10, Asilomar. 


Sahri, S., Litwin, W. & Schwarz, Th. An Overview of a Scalable Distributed Database System SD-SQL Server. In: Flexible and Efficient Information Handling: 23d British National Conference on Databases, BNCOD 2006, Belfast, Northern Ireland, UK, July 2006 Proceedings, Bell, D. and Hong, J. (Eds.), Lecture Notes in Computer Science 4942, Springer-Verlag, Berlin, Heidelberg, and New York, 2006, p. 16-35. 


Sahri, S., Litwin, W. & Schwarz, Th. Architecture and Interface of Scalable Distributed Database System SD-SQL Server. The Intl. Ass. of Science and Technology for Development Conf. on Databases and Applications, IASTED-DBA 2006, Innsbruck. 


Sahri, S. Design of Scalable Distributed Databases. The 2nd IEEE International Conference on Information & Communication Technologies: from Theory to Applications, ICTTA, April 2006, Damas, published in Information and Communication Technologies, 2006. 2nd Volume, p. 2918-2919
ISBN: 0-7803-9521-2.


Litwin, W., Sahri, S. & Schwarz, ThScalable Command Processing in SD-SQL Server: a Scalable Distributed Database System. 7th Intl. Workshop on Distributed Data and Structures (WDAS-7) accepted in SIGMOD Record, Santa Clara, CA, 2006, Carleton Scientific (publ.). 


S. Sahri, SD-SQL server: a Scalable Distributed Database System, The Dutch Belgian Database Day workshop, DBDBD 2005, Amsterdam. 


Litwin, W. & Sahri, S. Implementing SD-SQL Server: a Scalable Distributed Database System. Intl. Workshop on Distributed Data and Structures, WDAS 2004, Lausanne, Carleton Scientific (publ.). 




Scientific talks

Big datasets quality. Talk within the context of ASGARD program, NTNU, Gjøvik, Norway, Sept 2019.


CrowdIDV platform. Atelier à la micro-école Cloud IDV, Avril 2018, Paris.


Data quality in distributed databases. Keynote talk, LCSI de l’Ecole Nationale Supérieure d’Informatique, Algiers, Dec 2016.

Research data management, Symposium IDV- Imageries du Vivant, Jan 2016, Cap Hornu, France.


From Big Data in medical imaging to semantics. Regards Croisés sur l'Imagerie du Vivant, Concepts et Langages nterdisciplinaires”, Programme IDV, Nov 2015, Paris.

Privacy-Preserving Business Process Fragmentation. 2nd Workshop of du LIPADE (Paris Descartes Univ.), June 2012.

SD-SQL Server, a Scalable Distributed Database System, Laboratoire de Systèmes d'Information Répartis, EPFL, Lausanne, Oct 2007.

New Features for a scalable Distributed Database System, (with Prof. Litwin), Microsoft Research, Redmond, June 2007.

Structures de Données Scalables et Distribuées dans SD-SQL Server et SDDS 2005, Journées Académiques Microsoft, Apr 2006, Paris.



·     Information System Modeling and Analysis (license 3)

·     Advanced Databases (license 3, Master 1 CS, Master Miage)

·     Relational databases (license 2)

·     Big data management systems (Master 1 Miage),

·     Algorithms (L2)

·     Numération Logique (L1).

·     Relational databases (L2)

·     C2I (L1)


Other duties

·     Director of studies for master’s degree in Computer Science and Management (Miage), in the period 2013-2017.

·     Assessment and advising member of the undergraduate’ and graduate degree in Computer Science in the period 2015-2017.

·     Designed to be member of the laboratory council (conseil de laboratoire) since 2017.

·     Designed to be member of the department council (conseil de l’UFR) since 2021.