survivor bias, art of data analysissurvivor bias, art of data analysis
Spread the love

Loading

We often hear that data analysts and data scientists should be proficient at listening to data and the art of storytelling with data, but what does that mean?

We can learn a lot about the art of data analysis and critical thinking involved from Abraham Wald’s wartime analytics story.

Abraham Wald was reviewing damaged aircraft returning from Germany in the Second World War to find out how to better protect fighter aircraft. Abraham found that the fuselage and fuel system of returned planes are much more likely to be damaged by bullets or flak than the engines.

Considering this information, the question was: 

What sort of protective measures should he recommend to his superiors?

Red dots showing areas where aircrafts are most likely to be hit by enemy fire
Figure : Red dots showing areas where aircrafts are most likely to be hit by enemy fire

The obvious answer to the DoD and MoD was to protect the parts of the plane that were most heavily damaged and that’s what most military experts were recommending. However, much to the amazement of the military, Wald recommended the armor should go where there was no anti-aircraft damage.

Wald’s brilliance suggested an unconventional solution which saved countless lives.

Solution: Don’t armor the places that sustained the most damage on planes that came back simply because they came back and these areas can sustain damage!

Circle showing areas where armor should be applied to save the aircraft
Figure: Circle showing areas where armor should be applied to save the aircraft

These bombers’ wounds showed where they could afford to be hit. Said another way, the undamaged areas on the survivors showed where the lost planes must have been hit because the planes hit in those areas did not return from their missions.

Related

Introducing Data Science Gym – Technology Magazine (tech-mags.com)

By Hassan Amin

Dr. Syed Hassan Amin has done Ph.D. in Computer Science from Imperial College London, United Kingdom and MS in Computer System Engineering from GIKI, Pakistan. During PhD, he has worked on Image Processing, Computer Vision, and Machine Learning. He has done research and development in many areas including Urdu and local language Optical Character Recognition, Retail Analysis, Affiliate Marketing, Fraud Prediction, 3D reconstruction of face images from 2D images, and Retinal Image analysis in addition to other areas.