Blog Archives

Big Data, Scarce Data: Which One Fits Medicine?


While visiting CERN last spring, there was a catch phrase used during the visit that stick in my mind. For the ATLAS detector, at the heart of one of the 4 main experiments at CERN and also one of the experiment that found experimental evidence for the Higgs boson (or a Higgs boson…), the interesting data were the equivalent of a 100 megapixels camera taking 400 photos per seconds (or maybe the other way around, but it does not change the shear scale of things)!

This amount of data is after all kinds of real-time software and hardware processing because the raw data during normal operation (read beam on condition) is close to 1 PetaBytes/sec (MB, GB=1000MB, TB=1000GB and finally PB=1000TB)…and this is only for ATLAS. In fact everything about the Large Hadron Collider (LHC) is big, from cost, to equipment, to human resources and data generated. Nature had an interesting article about how the data are handle and distributed worldwide among the collaborators.


Now what about medicine? We hear a lot about big data in biological sciences and medicine. The main problem, at least in medicine in my opinion, is not that there is too much data for the researchers and physicians but rather the other way around. Database for clinical trials conducted at various levels (from internal trials at individual hospitals to more global trials) are not all, or at all(!), compatible with each other. Furthermore, numerous database tends to be incomplete not by design but simply from the difficulty of filling and ensuring data integrity. While big data also sounds great for personalize medicine, personalize medicine by definition means low numbers of very specific medical conditions. Overall, we are unfortunately at this point in time in a scarce data mode.

The next big step for big data in medicine is a revolution with regards to database management, sharing and analysis. And yes personalize medicine will likely mean bigger research consortium and more sharing of data. There is a lot to learn from the particle physics community and initiative like the LHC. I do hope that those big data grant programs we are seeing in our country is to address that in priority. Until then, we will remain with incomplete or scarce data in medicine.

Nobel Prize Week: Physics Nobel Prize for the Higgs Boson!

This is this time of the year again when Nobel prize winners are announced. As expected, the physics one goes for the Higgs boson following the experimental confirmation by CERN. More precisely the Prize is given to Englert and Higgs.

Note added: Physical Review Letters announced that the 1964 articles from the nobel winners are now available for free…Physics Letters B did something similar regarding the experimental papers from ATLAS and CMS during the summer of 2012. So all in all, the four key papers pertaining to then Higgs boson (at least for now) are available for anyone with internet access to consult!

Observation of a new particle at LHC: Booklet / Journal issue

Not that I am promoting any buying of “derivatives” from the discovery of the Higgs boson but there is some interesting (free) images and free access to a PDF booklet, which includes the two published articles in Physics Letters B by ATLAS and CMS:

Elsevier Webshop.

Worth a look 😉

The CERN LHC in 34 pictures

In the wake of this week Higgs discovery announcement, here is an amazing sets of 34 photographs of the LHC and the detector apparatus at CERN from The Atlantic: In Focus – The Fantastic Machine That Found the Higgs Boson – The Atlantic.

Simply fantastic!

%d bloggers like this: