Submitted projects

Title Project description Project duration Contact for further details Status
Distributed storage systems for big data

The group maintains a framework called dmlite which is used to integrate various types of storage with different protocol frontends. It is the basis of a number of the group’s products such as the Disk Pool Manager (DPM), a grid storage system which holds over 50PB of storage in the ...

3 to 9 months depending on the selected task Submitted
Performance optimization in a High Throughput Computing environment

Profiling of computing resources respect to WLCG experiment workloads is a crucial factor to select the most effective resources and to be able to optimise their usage.
There is a rich amount of data collected by the CERN and WLCG monitoring infrastructures just waiting to be turned into useful ...

6 to 12 months Submitted
Cloud data analysis

Cloud synchronization and sharing is a promising area for the preparation of powerful transformative data services. 

The goal of this project is to prepare CERNBOX to be used in connection with heavy-duty activities (large-scale batch processing) on the current LXBATCH infrastructure (LSF) and on its future evolution (HT-Condor): physicists can ...

6-12 months Submitted
Dynamic storage federations

The group runs a project whose goal is the dynamic federation of

  • HTTP based storage systems, allowing a set of globally distributed resources to be integrated and appear via a single entry point. The task is to work on the development of this project (“dynafed”), implementing functional and performance extensions,
  • ...
From 3 to 9 months depending on a selected task Submitted
File Transfer Service (FTS) extensions

The File Transfer Service (FTS) manages the global distribution
of LHC data, moving multiple petabytes per month during a run and underpinning the whole data lifecycle. Join the FTS team in their development of this critical service. Possible projects include

  • authorised proxy sharing: allowing a production service to delegate
  • ...
From 3 months, depending on task selected Submitted
QA in distributed cloud architecture: evolution of smashbox framework

Cloud synchronization and sharing is an area in evolution with innovative services being built on top of different platforms. CERNBOX is a service ran at CERN to provide at the same time synchronisation services (based on the OwnCloud software) and high-performance data access and sharing (based on EOS, the CERN ...

3-12 months depending on the agreed scope Submitted
QA in distributed cloud architecture: injection-fault testing

Clients of the sync&share system (CERNBOX) are particularly exposed to "operational failures" due to heterogeneity of hardware, OS and network environments. 

Sync&share system operates in very heterogenous network environment: from fast, reliable network inside the computing center to unreliable, high-latency ad-hoc connections such as from air-ports etc. 
Windows filesystems ...

6 months Submitted
Advanced Notifications for WAN Incidents

One of the main challenges in WLCG WAN networking is the network diagnostics and advanced notifications on the issues seen in the network. LHCOPN/LHCONE as the core global networks in WLCG have more than 5000 active links between 120 sites. Currently, most of the issues are only visible by the applications ...

12 months Submitted
Optimisation of experiment workflows in the Worldwide LHC Computing Grid

The LHC experiments perform the vast majority of the data processing and analysis on the Worldwide LHC Computing Grid (WLCG), which provides a globally distributed infrastructure with more than 500k cores to analyse the tens of PB of data collected each year. Profiling of the computing infrastructure with respect to the ...

3 to 6 months Submitted
Analysis of the I/O performance of LHC computing jobs at the CERN computing centre

The LHC experiments execute a significant fraction of their data reconstruction, simulation and analysis on the CERN computing batch resources. One of the most important features of these data processing jobs is their I/O pattern in accessing the local storage system, EOS, which is based on the xrootd protocol. In ...

3 to 6 months Submitted
e-learning - IT Collaboration, Devices & Applications - Indico Usability study

Indico is an open source web application for event organization, archival and collaboration. It is developed at CERN and evolves in the  IT Collaboration, Devices & Applications (IT/CDA) group

The application is used by tens of thousands of users around the world and across projects, universities, laboratories and UN ...

6 months 2 days/week, often from home Maria Dimou Submitted
e-learning - IT Collaboration, Devices & Applications - Insert subtitles in video tutorials

Use a free tool to convert existing plaintext files, containing the exact script of our short online e-learning videos, into .vtt files, in view of introducing subtitles.


  1. Click on each Indico event from The List below. It contains the link to the Recording and the script as attached file
  2. ...
1 month Maria Dimou Submitted, Accomplished
Malt-related project: Standard documentation workflow and conversion tools for documentation, slides etc

In order to standardise the way service web sites (both for user facing and administrators pages) are handled across the group and department, IT/CDA proposes a standard way to create, maintain and serve service web sites, based on modern and open technologies (Markdown, Gitlab and Openshift). This is now documented ...

1 year Maria Dimou Submitted
MAlt: Preparing the new CERN telephony service

The CERN telephony landscape is going to experience major changes in the coming two years with a new fully open IP telephony system currently in development in IT.

The student will be involved in various aspects of the preparation of this new service: adapting the existing provisioning interfaces and processes ...

1 to 2 years Submitted
Malt-related project: New documentation testing for up-to-dateness and functionality

The group Collaboration, Devices and Applications (CDA) in CERN IT Department provides a big number of services which are very visible to the end-user. For a complete list of services, please check our internal site.

These services are documented in many pages written in Twiki, Sharepoint, Drupal or Markdown. ...

up to 6 months at 50% working time (20hrs per week) Maria Dimou Submitted
Malt-related project: CDA Jekyll site finalisation

Site is useful as the Usability tests showed. A summer student could finalise it by:

  1. Replace the bubbles of page by another view, equally attractive but showing properly on mobile devices.
  2. Write the script that synchronises the short description in each service with the relevant SNow
  3. ...
2 or 3 months Maria Dimou Submitted

You are here