Skip to Main content Skip to Navigation
Conference papers

Data management in EGEE

Abstract : Data management is one of the cornerstones in the distributed production computing environment that the EGEE project aims to provide for a e-Science infrastructure. We have designed and implemented a set of services and client components, addressing the diverse requirements of all user communities. LHC experiments as main users will generate and distribute approximately 15 PB of data per year worldwide using this infrastructure. Another key user community, biomedical projects, have strict security requirements with less emphasis on the volume of data. We maintain three service groups for grid data management: The Disk Pool Manager (DPM) Storage Element (with more than 100 instances deployed world-wide), the LCG File Catalogue (LFC) and the File Transfer Service (FTS) which sustains an aggregated transfer rate of 1.5GB/sec. They are complemented by individual client components and also tools which help coordinating more complex uses cases with multiple services (GFAL-client, lcg util, eds-cli). In this paper we show how these services, keeping clean and standard interfaces among each other, can work together to cover the data flow and how they can be used as individual components to cover diverse requirements. We will also describe areas that we consider for further improvements, both for performance and functionality.
Complete list of metadatas
Contributor : Sabine Starita <>
Submitted on : Tuesday, May 18, 2010 - 11:17:03 AM
Last modification on : Wednesday, September 16, 2020 - 4:23:40 PM

Links full text





Á. Frohner, J.-P. Baud, R. M. Garcia Rioja, G. Grosdidier, R. Mollon, et al.. Data management in EGEE. 17th International Conference on Computing in High Energy and Nuclear Physics (CHEP'09), Mar 2009, Prague, Czech Republic. pp.062012, ⟨10.1088/1742-6596/219/6/062012⟩. ⟨in2p3-00484260⟩



Record views