The Anatomy of Citizen Cyberscience

by Suw on September 2, 2010

?Why do people get involved in citizen science?

Becky Parker

When I was young, and watching the RI Christmas Lectures, was inspired by Carl Sagan, where he went and had tea on Mars. Can remember where I was sitting, what he said, and thought that would be amazing! Moved from girls’ school to comprehensive, and was only girl doing double maths, physics and chemistry. Was so interested in astronomy, went to Norman Lockyer observatory, and go look at the stars. Ended up in physics. Had Tony Legget, Nobel Lauriate, and to courses on foundation of quantum mechanics. Thought, this is so amazing, why don’t students love it? Went into teaching, and want to inspire students? Very lucky, have a good school, head that supports projects.

 

Julia Wilkinson

Gave up science at school, girls weren’t encouraged to do it, and has regretted it ever since. Apollo missions inspired her to get into astronomy. Passion for astronomy has lasted all her life, got back into it 10 years ago when bought a telescope. Three years ago, was looking for a way to get more involved, saw Stardust@Home, and thought, I can contribute to real science. A few weeks later, found Galaxy Zoo, and that was better as observed galaxies through telescope, so this was what she wanted to be involved in. On the back of this, now studying science with the Open University. Has experience with volunteers, and has noticed a lot of overlap in terms of way that citizen science works with volunteers. See same patterns of behaviour. Voluntary sector, constant need to motivate volunteers, lots of challenges, feedback etc. That’s what cyberscience does, but if you let volunteers know exactly what’s happening with the data, that increases morale.

 

?Richard Haselgrove

Interest first formed in childhood, parents both involved in early days of electronic computers, so grew up with them. Went to standard school career, university, and that’s as far as it went at that stage. Moved into the public sector. Left science and computing behind until arrival of personal computer, in around 1980. Could then start to experiment with computing for more general purposes. Now, linking of communities by technology taken for granted. Read in press about SETI@Home, reconnected with scientific interests. Computer a volunteer, but he wasn’t. Now he’s nearing retirement, is more able to volunteer himself as well. As people have more time to commit, volunteers do gain a lot of experience, what draws him further in is developing knowledge that he can pass on to arriving volunteers and to new projects. Can’t always get involved with the science behind a project, but can help with the project from a how you deal with volunteers, the platform, etc. We as volunteers have an impact on scientists, and have a lot of valuable insight to feed back into the projects.

 

Christian Behr (?)

Started with SETI@home, was doing internship in web development and someone there showed it to him. Have run it on every computer he’s had since then. Got interested in BOINC, and also the science, not just how they are searching for aliens, how the volunteers work together. Contribute not only computer power, but also knowledge in programming. Motivation to learn how to programme. Also wanted to give the knowledge away, but it’s not giving it away, it’s multiplying it. Social part is great motivation.

 

?Bruce Borden

Interested in similar stories that I’m hearing – we don’t know each other, but I have a lot of things in common with what’s already been said. Am a retired scientist, advanced degree in maths, worked for an engineering firm doing mathematical analysis. Concepts of maths and how to do simulations are comfortable. When he retired, asked same questions about what he was going to do with the rest of his life, how could he spread the knowledge he has. Had discovered SETI@home, thought it was an intriguing idea. A few years later, got interested in Folding@home, Standford University’s programme. Also influenced by previous volunteer work, spend two years when he was younger teaching maths as part of the Peace Corp. Important aspect now is that he’s hooked by the science. Also important is managing volunteers, keeping them enthused, and this is an area that is grossly neglected. Need to take care of volunteer’s feelings about what they are doing. Wide range of how we have to deal with volunteers based in part on their skill level, there is a wide range of people, need to deal with them in slightly different way. Skills I can deliver in addition to teaching about the science or maths, or computers, work with the forum primarily with an educational goal.

 

Ian Hewliss (?)

Qualifications are simply that he watched a TV programme about climate change, and invited viewer at the end to run some software. Thought it was easy, but others found it harder. Rang BBC climate change project and started helping people with technical problems. Sense of community builds up, because people feel what they are doing is relevant, and it references what other people are doing. He is a physics graduate, since then used that to model behaviour of satellites, so now does radio comms. Accidental cyber-scientist. What’s his motive? Hard one to answer. Word ‘citizen’ implies a community. Used to talking about ‘citizens’ in a political way, right and obligations, there are two communities in which citizen word is relevant – one is to ask, why do people participate? Also community of a particular project. Debate to be had about balance of rights and obligations of participants.

[Then followed a discussion, which I'm too tired to transcribe! Still, interesting stuff.]

 

 

{ 0 comments }

Enabling school students to do real science via CERN technology. Want to show kids that they can be involved in what is going on now, that science is vibrant and interesting subject.

The Langton Star Centre is ?at Simon Langton Grammar School, Kent, gives student change to work alongside scientists and engineers. They go to CERN every year, and one year visited the lab of Dr Michael Campbell’s Medipix lab, working on a chip for the ALICE detector. Can be used for medical imaging. Lots of research and collaboration using this chip.

Competition for schools to design experiment to go into space. Wanted to do a cosmic ray intensity detector using Medipix chip – called LUCID, which ?will fly in 2012. Won, but project was a bit expensive so they have an earthbound version too.

Wanted to get more schools involved, which led to CERN@School, so different schools can look at cosmic rays in space and on earth via detector in their own lab. Pick up data and then examine it in the school lab, can do particle recognition.

Found it hard to get the money together, but got a pilot scheme in a ten other schools that take data at a set time each day. Schools pool the data, then it goes up on LSC servers and can see it on a map. But how to analyse it and get good science out of it? Now have a model using grid storage and computing. Will soon be able to do analysis of tracks.

Next step would be linking up with other cosmic ray projects.

Expanded project would enable sophisticated analysis and potentially useful result. If had enough schools, would have an enormous network of detectors, might be able to discover particles above GZK limit.

CERN@School invigorates teachers as well as inspiring students. Hope to attract more scientists into schools. Doing real science, real analysis, is not only fantastic, it also shows how smart and capable these students are.

{ 0 comments }

Herbaria@home, herbarium records, snapshot of the world before agriculture, including areas now completely obliterated. Found plant, Ghost Orchid, thought to be extinct.

Plant don’t move, but they do invade, e.g. Oxford Ragwort. Scilian plant introduced in 1700 to an Oxford botanical garden, escaped, and now has spread out across UK. Roesbay willowherb, but railways have distributed seeds and now it’s everywhere. Plants also go by road, e.g. Danish scurvy grass, should be coastal, but now has colonised verges.

Plant populations are in flux. Modern survey data alone isn’t enough, so need the historic data to give context.

Web based project to catalogue old data. Collection of 50,000 documents he is working on, several million UK-wide, but even with willing volunteers (in person) there are too many records.

Online, Wikipedia established that people would do this sort of online work, you can allow open access editing and it wouldn’t be mayhem. Distributed Proofreaders showed that people will transcribe text from the internet.

Have taken photos of documents and put them online along with a form that people can fill out to say what they see in the label data around the specimen image, e.g. site names, collectors, date.

Some of the documents are quite clear, because they are printed, but there are a lot of hand-written documents, e.g. from 1859, and the handwriting poses quite a problem. Handwriting recognition may eventually get there but it is quite a long way off.

Once you have the data, can give it a grid reference and put it on a map. Can validate a lot of the data as they enter it. Need volunteers to collaborate and discuss what they see, so have active message boards. Have a pretty expert volunteer set, e.g. with plant recognition especially of rare plants.

Have worked on collections from several large universities and museums, and often they don’t have a full time curator for these collections so that data is inaccessible otherwise.

Peer-revuew of records, people have free access to edit anything, and public edit history for every record. Botanists able to spot errors and make changes, but a lot of non-professional botanical expertise, people keen to work on a project like this.

Some similar collections likely to come online in similar projects soon, e.g. insects.

Benefits: improve access to collections, raise profile of collections, and people enjoy it as a hobby.

{ 0 comments }

Einstein@Home is a traditional citizen science project. Have 100,000 computers at any one time contacting project and looking for work. People join the project by joining website, download & install software, and then leave it alone. Get a screensaver (which is very pretty!), and when their computer is idle it is analysing data.

Physics experiment data. Not simulating, but taking real data about physical world and searching for very weak signals that reveal neutron stars – very compact, small start, 10km radius, which beams radio waves like a lighthouse. As beam passes by Earth you see a flash. Forms when an ordinary star burns all its fuel and collapses under gravity, electrons get crushed into the nucleus, combine with protons to form neutrons, which are 100x smaller than the original atom. They spin very quickly for same reason an ice skater spins faster when they pull in their arms.

Example, Crab Pulsar, formed 1054AD, spins 33 times per second. About 100m neutron stars in galaxy, but have found about 1900, mostly near us.

Einstein@Home uses gravitational waves to search for neutron stars sent out by the star. Detectors, built in last 20 years, made of mirrors hanging from wires, and when a gravitational wave comes along the mirrors swing a bit. That can be detected.

Also use data from Arecibo radio telescope in Puerto Rico. First discovery: 11 July 2010. Signal was followed up the next day and reconfirmed quickly.

Found a second radio pulsar, currently unpublished, appears to be a binary system, but not yet clear what the masses of the stars are.

Publicity of first discovery has been very inspiring for users and project team. New users jumped when the publicity happened, and the number of users leaving the software running continues increase.

Square Kilometer Array, which will come online in 2019 or later will produce so much data that distributed computing may be the only way to process it.

{ 0 comments }

 

Most citizen computing projects can do no wrong. That does not apply to climate science. ClimatePrediction.net raises a few hackles, often in situations such as a recent meeting about investment in supercomputer, and when someone said, “I hear you can do a lot of this on PS3s nowadays?” there was a degree of hostility.

It’s just a way of addressing certain problems. People think of models as being done on a petaFLOP Cray XT-6, but most climate scientists don’t have access to these. They can have access to citizen scientists.

Climate modelling depends on:

  • Complexity, e.g. number of processes, number of aspects of the sytem
  • Resolution of your model, e.g. 100km scale, 10km scale or 1km scale
  • Duration of the run
  • Ensemble size, (groups of models) and this is where citizen cyberscience comes in

Often need to run models may times, and this is where ensembles come in.

Uncertainty in models varies. Uncertainty was felt to be underreported, so added subjective assessment of uncertainty, some of numbers are a little bit rounded, as they were decided on through discussion.

Suspected the model ranges were too small was because all the models matched the 20th century numbers ‘suspiciously well’. Need unrealistic models as well as realistic ones. You have to go outside the range that is fits perfectly in order to be sure you know what that range of forecasts are consistent with current observations.

Serious money to do a run, so they are looking for good models, not ‘bad’ ones.

By doing tests of different models (using citizen science), see that there -40 error bar was too pessimistic in terms of uncertainty, and the +60 was about right.

Learnt that the lower bound too low, upper bound about right, but this was through experimentation, not discussion. Therefore is testable.

What next? Using volunteer computing to see how extreme weather and climate are related, as global warming can cause both extreme hot and cold weather events, e.g. heatwave in Russian, Pakistani floods. Were they one event or two? Where they related to global warming?

Looking at the flooding in UK in 2003, simulating seasons where damaging weather events occur, both with and without the signature of climate change, to see if it had an effect. Looking for influence of external driver – human influence.

These are rare events, so have to model them many times to see if risk of extreme event has increased.

Projects in development, embedding regional models in simulations.

Have only used participants to provide compute power, so haven’t engaged participants brains. Big challenge faced is that only a few hundred people take part.

 

{ 0 comments }

Comp scientist at UC Berkley, building platform for citizen science. Looking for commonalities, software support that addresses community’s needs to make it easier for scientists to use volunteer power. Tech is only one piece of the solution.

Build platforms for:

  • Volunteer computing
  • Distributed thinking
  • Education

Computational science:

Simulations are now so vastly complex that they can only be done on computer. Simulations at various scales,  e.g. proteins, ecosystems, Earth, galaxy, universe. . Need lots of computing power because need to fit models to observed data. To predict what’s going to happen you need to run thousands or millions of simulations.

Generation of new instruments, e.g. LHC, LIGO, SKA, gene sequencers, produce data at unprecedented rates, right at limit of computers to handle. Beyond limit of computers owned by unis or institutions. Science limited by computing power and storage capacity. What we need is not a faster computer, but higher throughput, i.e. a lot of computers.

Consumer digital appliances, e.g. computers, handhelds, set-top boxes, are all converging on similar hardware. Networks that connect them all: consuemr digital infrastructure. 1.5 billion PCs. Graphics processing improved through desire to watch HD TV and play realistic games, and GPU oft 100x CPU speed. Put there for games, but good for science.

Storage on consumer devices approaching the terabyte scale, network approaching 1 Gbps.

All this is ideal for science computing!

Compare consumer digital infrastructure with institutional counterpart, it’s way bigger and way cheaper than institutional computing. Supercomputers moving towards an ExaFLOPs in 5 years, but consumers already have 1000 ExaFLOPS today. Consumer spend $1 trillion per year!

BOINC, free open source software, anyone can create a project.

Utopian ideal, to have a lot of these projects, getting computing power by advertising research to the public, educating public on their project, so public supplies resources to science where they want to put it.

Boinc projects

  • ~30 projects
  • 300k vols
  • 530k computers
  • 3 PetaFLOPS

Volunteers can do more than run software. they can provide tech support, can optimise programme, translate website, recruit new users. Initially used message boards, realised that they weren’t working well for non-tech systems, so now have a system based on Skype, so people needing help can find someone who is willing to give that help via Skype.

Volunteers have a spectrum of confidence, and some users are malicious, e.g. they have had people trying to scam others users and trying to get their PayPal IDs.

Motivations study. People interested in doing science, want to show the world they have the fastest computer, people who want to be a part of a team.

Distributed thinking. Stardust@Home, interstellar dust photos, looking for grains of dust. People can do this better than computers. Interesting thing was needing to quantify accuracy of results. Created samples where they new the answers, i.e. either contained noise or had a particle. Every 5th image was a callibration images and so could keep track of false positives and negatives.

Also used replication, many people look at it, and then if there is consensus can look at calibration results and that shows if consensus is correct. Project found all the dust particles it could.

Created platform called Bossa. Middleware for distributed thinking, provides scheduling mechanisms, e.g. calibration jobs, replication. Open system with respect to assessment, scheduling policies.

Being used to find fossils.

Also extending Bossa to Bossa Nova, looking at more complex systems for asking people to do things involving creativity, problems for which there is no unique answer. E.g. complex problem solving, use volunteers to decompose problem into sub-problems, propose solutions, evaluate them, evaluate how a group of solutions might work together. Involves different skills. At software level, it uses people optimally, uses them for tasks to which they can contribute most.

Education and citizen science. If we can train people to do more complex staves we can achieve more. This is very important – if people learn more they may stick with a project longer, recruit more people to help, and you get more computing power.

Challenge of training or educating 100,000s people is a challenging problem, not attacked by traditional education theory. Heterogeneity is problematic: different backgrounds, education levels, locations, language. What makes this tractable is that there is a constant stream of students arriving, dozens, 100s, 1000s new users per day, so we have lots of people arriving interested in a course, so we can do experiments. If we haev two alternative ways of teaching a concept, we can rig up the software system to randomly show one lesson or the other and then they take the same assessment.

May learn one lesson is better than another, or one lesson is better for a subset, e.g. based on demographic or other attribute, and can then make an adaptive course where as we learn more about the student we refine how we teach them. Not just individual lessons, but overall structure of course.

Bolt: system for tailored education for large streams of volunteers.

{ 0 comments }

Where’s George? is a project tracking one dollar notes in the US. Put a stamp on the note, and log it on the website. Creates link between the two places where the not was logged. Provides a lot of data about how money moves around the US, therefore data about how people move around the US.

Administrative system of US: Divisions contain States contain Counties. Spatially compact hierarchical structure historically evolved with geographic determinants.

Human mobility – are these divisions spatially compact, determined by geography? How much geography is encoded in the network of money movements?

Can make groups of counties, run algorithms to test the strength of borders between devisions/states/counties. Shows Mississippi River is a strong border, as is Colorado/Ohio border.

Compare to epidemiology, SIR model – susceptible, infectious, recovered, i.e. what happens when an infectious person meets a susceptible person, etc. No spatial element in this model. When modelled, see a wave of infection moving through the world.

But modern transportation, aviation, changes it. Incorporate that, creates new model of spread of disease worldwide, e.g. SARS. Modelling based on country doesn’t provide enough granularity. Using Where’s George? data, can see how people move around the US.

No dataset like WG. Local travel data and aviation data, but doesn’t show full picture.

Nw model for Swine Flu, combine WG data with for spread, and show where possible flu hotspots might be, e.g. LA, Dallas, Miami, Chicago, NY areas.

Can look at multiscale mobility and local mobility, they show very different spreads of disease.

{ 0 comments }

I’m here at the Citizen Cyberscience Summit for the next two days. Expect quite a few notes (although probably not all session!).

Lessons in crowdsourcing.

Nothing new under the sun. Science shaped by forces that have existed a long time, and we understand them. Works on calculation and making mathematical models work. That has long history as being citizen science, going back 200 years. Take large task, divide into small exchangeable jobs, send out into the world with instructions on how to do them. Charles Babbage wrote extensively about this in the 1830s.

Babbage was thinking about this because of his computing machine, how can you split big tasks up? Activity is largely about starting something, drive is towards radical organisation that moves towards convention, where you have people who lead, follow, varying levels of skill.

Babbage’s discussion, prime example he worked off, Gaspard DeProny, French Revolution, labour was very cheap. DeProny got 100 people to do maths tables, required for surveying.

~100 years later, 1875, post American Civil War, lots of people out of work, women who were widows, organised group of computers, i.e. women, who put together the Harvard Star Catalogue. Then again in 1907 by US Naval Obs.

And again in 1938 during the Great Depression – 450 people working at tables with paper and pencils doing calculations for scientists or government. Maths Tables Project.

A few people have adding machines, they are the leaders.

A NYC computing office, Columbia Uni Stats Computing Lab, 1930, had 6 employees.

Most of the Maths tables computers hadn’t been to high school, organised by arithmetic opersations e.g. – or +.

Often drew from poor classes at that time: blacks, women, Irish, Jews.

Planners had a doctorate or masters, operated 1938 – 1948, remnants existed til 1964.

Most scientists/engineers didn’t have access to computers until mid-60s when they had timeshare. Some not until the 70s. They worked with a worksheet, that was planned, a bit like programming. Early programmers were called planners, coding was called planning. Instructions, so workers didn’t need to understand what they were doing.

Built 28 volumes of tables, e.g. powers of integers, exponential functions.

Discovered there were particular skills, and started to look for specific calculations that were good for generating revenue, e.g. OSRD calcs, microwave radar tables, explosion calcs; LORAN navigation tables (precursor of GPS); general science calcs, e.g. Hans Bethe paper on Sun. First test of linear programming.

Labour economics. As you build skills, people want to use their skills and want to be rewarded for that skills. They want to advance. WPA studied labour and skill as a way of building identity.

Building skill

  • Identity
  • Accomplishment
  • Avancement

Aspirational issues: Everyone wanted to be special.

Special Computing Group, had a room to themselves, had machines, almost all were women, and everyone wanted to move up to the special computing group. They started offering courses at lunch to develop those skills. Once they had completed that, were were opportunities outside. So lots of places they could apply those skills. Best measure of ability was the skill of the group.

They recognised that losing members of the group, breaking it up would damage the organisation. Started to have difficulty getting work, so soliciting work from scientists.

Gertrude Blanch, PhD, chief mathematician. Ran the computing group. Had to take everyone who came in the door, had to find ways to enforce discipline, e.g. people who didn’t think about carrying the ten, or who couldn’t concentration. Calculations were done 3 – 10 times each, didn’t just duplicate for sake of it.

“People doing hand calculations computing the same number the same way make the same mistakes” – Babbage

So did same calculation in different ways to ensure accuracy.

Crucial issue: How do citizen scientists relate to professionals? Professionals build walls around themselves.

National Academies of Science said they wanted to be of use to the gov’t, and help scientific work. WPA sponsored a lot of science. Internal comms of Nat Academies are ‘embarassing at best’. Group realised it was in a position it didn’t want to be in, and internal memos had one set of reasoning that repeats:

“Scientists are successful people. The poor, are not successful, because they are poor. Therefore can conclude that hte poor are not scientific. Ergo, maths tables project is not scientific, ergo their work is not good.”

NAS wanted the budget for the maths tables and wanted to do it “right” and ‘well”. But they could never have replicated it with students as budget wouldn’t cover it.

Handbook of Mathematical Functions: Largest selling science book in history.

Gertrude Blanch, finished PhD in 1934, was Jewish, was never going to be employed by anyone, but doing the maths tables project lead her to be employed by the Air Force, worked on supersonic air flow, jet nozzles.

The thing that we are doing is building skill amongst the general populous can never be overlooked.

Maths group had droped form 450 people to 120 as labour costs were higher. But claimed 120 was as efficient. by 1946, group had fallen to 60 people with specific skill, but still as efficient. People have titles, skills, identifiable expertise.

Lessons:

  • We are creating skill, not just exploring the universe and doing science,
  • Get people who want to be identified with project, part of their identity.
  • Builds org with hierarchy, divisions of labour based on skill
  • Encourage aspiration

DeProny users Adam Smith as justification, first 2 ch of Wealth of Nation, division of labour, identifying people with skill. Forces that shape these orgs and relationships that will support science, there is also a political economy that shapes it. which builds skill but divides jobs, creates leaders and followers. Must deal with science self-defining as enclosed domain.

 

{ 0 comments }

I’m often asked at conferences and by journalism educators what skills journalists need to work effectively in a digital environment. Journalism educator Mindy McAdams has started a nice list of some of these skills in a recent blog post. A lot of journalists (and journalism educators) scratch their heads over what seem an ever-expanding list of skills they need to do digital. It feels like inexorable mission creep.

I can empathise. One of the most difficult parts of my digital journalism career, which began in 1996, has been deciding what to learn and, also, what not learn but delegate to a skilled colleague. I’m always up for learning new things, but there is a limit. Bottom line: It’s not easy. In the mid-90s, I had to know how to build websites by hand, but then automation and content management systems made most of those skills redundant. It was more important to know the possibilities, and limits, of HTML. When I worked for the BBC, I picked up a lot of multimedia skills including audio recording and editing, video recording and basic video editing, and even on-air skills. I also was able to experiment with multimedia digital story-telling. However, with the rise of blogs and social media, suddenly the focus was less on multimedia and more on interaction. All those skills come in handy, but the main lesson in digital media is that it’s a constant journey of education and re-invention.

What do I mean about choosing what not to learn? In the mid-90s, I was faced with a choice. I could have learned programming and become more technical, or I could focus on editorial and work with a coder. I did learn a bit of PERL to run basic scripts for a very early MySociety-esque project about legislators in the state where we worked, but after that, I handed most of the work off to a crack PERL developer on staff. I knew what I wanted to do, and he could do it in a quarter of the time.

I knew that my passion was telling stories in new ways online and, whilst I didn’t learn to programme, I did pick up some basic understanding of what was possible: Computers can filter text and data very effectively. They can automate repetitive tasks, and even back in the late 1990s, the web could present information, often complex sets of data, in exciting ways. I realised that it was more important for me to know the art of the possible rather than learn precisely how to do it. My mindset is open to learning and my skillset is constantly expanding, but to be effective, I have to make choices.

One thing that we’re sorely lacking as an industry are digitally-minded editors who understand how to fully exploit the possibilities created by the internet, mobile and new digital platforms. Print journalists know exactly what they want within the constraints of the printed page, which often in presentation terms is much more flexible than a web page. However, they bring that focus on presentation to digital projects. They think of presentation over functionality, largely because they don’t know what’s possible in digital terms. As more print editors move into integrated roles, they will have to learn these skills. They will eventually but, by and large, they’re not there yet. Note to newly minted Integrated editors: There are folks who have been doing digital for a long time now. The internet was created long before integration. We love to collaborate, but we do appreciate a little R-E-S-P-E-C-T.

In terms of learning the art of the possible, my former colleague at The Guardian, Simon Willison, has summed this up really well during a recent panel discussion:

I kind of think it’s the difference between geeks and the general population. It’s understanding when a problem is solvable. And it’s like the most important thing about computer literacy they should be teaching in schools isn’t how to use Microsoft Word and Excel. It is how to spot a problem that could be solved by a computer and then find someone who can solve it for you.

To translate that into journalism terms, it’s about knowing how to tell stories in audio, text, video and interactive visualisations. It’s about knowing when interactivity will add or distract from a story. It’s an understanding that not every story need to be told the same way. It’s about understanding that you have many more tools in your kit, but that’s it’s foolish to try to hammer a nail with a wrench. It’s not about building a team where everyone is a jack of all trades, but building a team that gives you the flexibility to exploit the full power of digital storytelling.

{ 0 comments }

Silly season’s here again

by Suw on August 19, 2010

Earlier in the week, Channel 4′s Samira Ahmed sent these messages to Twitter:

SamiraAhmedC4: MATHS HELP! Do I need to say “comma” if I read out this formula tonight: p(h,r)=u(h,r)-pr=g(h, Zr)+f1[h, m(o,r)]+f2[h, m(o,r)]+E-pr.
Aug 16, 2010 03:56 PM GMT

SamiraAhmedC4: It’s the formula to explain how Blackpool (like Bath before it) is becoming classier.
Aug 16, 2010 03:57 PM GMT

If you’ve spent any time at all watching the debunking of bad science coverage, you’ll be wincing, because that formula has all the signs of being total tosh. August is silly season, the time of year where PR companies know they can trot out any old rubbish and it’ll make headline news because nothing else is going on. It’s a tried and tested method.

Ben Goldacre spends quite a bit of time debunking not just silly season stories but also flaws in the media coverage of health and medicine stories that could have serious public health repercussions. It was entirely unsurprising that he should see Samira’s tweets and dismiss them out of hand, given the PR industry’s history of producing bunkum formulae to promote their own brands.

Ben said:

BenGoldacre: .@SamiraAhmedC4 no, you just have to say “by reading this out, i have lost all respect for myself as a journalist”

Ben is followed by a lot of people who hold similar viewpoints to his and a pile-on ensued, with quite a few people being unpleasant to Samira.

Update 13:06: Gordon Rae has found the original press release from Nottingham University Business School.

The story seems to have originated from the the PA, who’d done a very shoddy job in covering it:

Resort’s winning formula hailed
Academics have claimed new Premiership heroes Blackpool as living proof of a formula predicting the resurgence of the fading Lancashire resort.

The equation is based on how different social classes interact to make or break a holiday resort.

Nottingham University Business School used the rise, fall and renaissance of Bath since its 18th-century heyday as the original basis for the theory. But now they claim Blackpool’s return to top-flight football shows the formula applies.

“Academics have claimed” is a classic fudge which often really means “We got sent a press release and can’t be bothered to actually find out any more about the story so we’re just going to make it fuzzy round the edges and hope no one notices”. It’s no wonder that people thought it was nonsense. It had all the signs.

It turns out that the story is actually based on a published paper:

The rise, fall and renaissance of the resort: a simple economic model
Author: Swann, G.M. Peter
Source: Tourism Economics, Volume 16, Number 1, March 2010, pp. 45-62(18)
Publisher: IP Publishing Ltd

When he found out, Ben apologised both on Twitter and on his own Posterous.

BenGoldacre: .@samiraahmedc4 humblest apologies, all the outward signs of bullshit were there, and was impossible to tell from PA report. sorry!

Many of his followers who had been rude to Samira also apologised to her.

Now, normally, this little spat wouldn’t be worth blogging about. A disagreement between people on Twitter that resolves amicably is barely worth a second thought. It happens all the time.

But the idea that, after the friendly apology, it was all water under the bridge is a little undermined by Samira’s article in today’s Independent about it, which in my opinion not only sports a lot of unnecessary ad hom attacks, but also fails to draw the most important conclusions from this storm in a teacup.

The title, Samira Ahmed: Targeted by the ruthless Twittermob, sets a poor tone from the off. I’ve had a look through the Tweets and “ruthless Twittermob” it was not. Snarky, rude, inconsiderate and thoughtless group, yes. But ruthless mob?

Samira begins by explaining that she is new to Twitter and had got some advice from “old Twitter hands”:

1. Twitter works best as a two way networking tool – asking as much as telling. And 2. Scientists, and the writer Ben Goldacre in particular, can get a bit aggressive on it.

The first piece of advice is good. The second is both a sweeping generalisation in regards to scientists and an ad hom towards Ben.

I flagged this second sentence up on Twitter, and Samira told me that it had been added by the sub and that she was unhappy about it, so we’ll have to take the entire piece as an amalgam of Samira’s own writing and the Indy’s sub’s writing, as we have no way of telling them apart.

Update: Whist writing this, this sentence has been updated to: “2. The science writer Ben Goldacre can get a bit aggressive on it.”

But getting the first, now even sharper, ad hom against Ben in before the end of the first paragraph makes me wonder what the point of this piece is. If all is forgiven and everyone has apologised, why go to a national newspaper to drag everything over the coals again? Was this piece written to examine the phenomenon of herd-like behaviour online and the psychology that might explain it? Or to have a stab at Ben and by association, his newspaper, The Guardian?

The second para takes another swipe at Ben, about how he got “his science facts wrong and launch[ed] a personal attack on my journalistic integrity.” Ben commented before checking the facts, and then apologised when he realised the formula was real. He shouldn’t have jumped to conclusions like that, but I do feel Samira’s overstating her case a bit. Here are his tweets in order, so you can make up your own mind:

.@SamiraAhmedC4 no, you just have to say “by reading this out, i have lost all respect for myself as a journalist”     6:07 PM Aug 16th

any nerd bloggers who want to pre-mock C4 News, looks like theyre covering this bullshit http://bit.ly/cmBfBx http://bit.ly/98aJKC    6:12 PM Aug 16th

.@SamiraAhmedC4 i’ve written a lot on this kind of lame non-journalism, some of it in this category here: http://bit.ly/dBGWLX     Mon 16 Aug 17:17:53 2010

.@SamiraAhmedC4 youre the one in a position to judge, all i can see is an “equation” with no terms defined. put press release online for us?     Mon 16 Aug 17:40:22 2010

.@alexbellos @samiraahmedC4 haha no, wait, for the first time in media history, this is actually a real formula! http://bit.ly/9xolOM     Mon 16 Aug 17:46:54 2010

Reading on in the Indy article we find a yet another ad hom:

We all enjoy self-styled sheriffs like Goldacre roaming the web setting their posses on quack doctors. But journalists like me, who work for major news broadcasters, operate under a code of conduct broadly similar to our television content.

I’m not sure where to start on that one, other than if Ben’s comment was an attack on Samira’s journalistic integrity, that doesn’t make it ok for her to attack his.

The lessons Samira draws are that reputation is important, Twitter isn’t about broadcasting, and that it’s a good tool for finding new voices. That’s all fair. But she misses some other key lessons from her particular experience:

1. Understand the culture of the community you are entering
This is the first thing that I tell all my clients who want to use social media. It’s 20% tools, 80% people and if you don’t understand how people relate to each other in the context of the community you are trying to be a part of, you will make a mistake. Samira’s mistake was in not understanding that posting a formula and asking how to pronounce without providing either context or source could be misinterpreted.

The lesson from this should have been that if you ask for help with something scientific, provide a link to your source material first. That source material should be the academic paper if you have one or the press release if you don’t. If you’re writing a story based on a press release, be really, really careful what you say.

2. Twitter escalates the bad and the good, irrespective
Twitter does great things, spreading the word about important issues or worthy causes. But Twitter is made up of humans, and humans sometimes get things wrong. In those cases, bad words will spread just as far, just as fast. This is unfortunate, but it is pretty predictable.

The lesson here would be that when something goes bad, try to understand what happened and why, and then nip it in the bud as fast as possible. Samira failed to provide context and without context the formula looked like nonsense. Rather than asking Ben to DM or email, it might have been more effective for Samira to hunt down the original paper (which she should have had to hand anyway) and post the link to that on Twitter. Although Samira mentioned “Nott U biz school” on Twitter, it seems she didn’t link to the paper itself.

3. Everyone makes mistakes
On Twitter especially. Everyone, from @StephenFry on down, at some point Tweets something that they later regret. From public messages that should have been DMs to snarky comments that one later regrets, pretty much everyone says something daft on Twitter eventually.

This lesson’s easy. With great power comes great responsibility. Ben should understand that with 57,004 followers, he has great power. I understand his temptation to snark first and ask questions later, but some pre-snark research may well have changed his mind about what to Tweet and saved everyone some hassle.

Neither Ben nor Samira have covered themselves in glory here. And normally, I wouldn’t even bother covering this spat, if that had been all it was. But I get a little cross about these sorts of ‘Twittermob’ stories, because they remind me of the old school “The internet is full of axe-wielding murderers” stories that used to get published so often a decade ago (and still do by the red tops). That’s just wrong. They show a distinct lack of understanding of Twitter and social media in general and extrapolate too far from personal experience, emphasising the bad and generally ignoring that it’s well outweighed by the good. That has the potential to dissuade people from taking part in what can actually be a vibrant, supporting, intelligent, friendly place. And we’re all the poorer for that.

Update 22:40: Samira has emailed me to ask if I would post a full set of her own Tweets, which I am happy to do. I had to take the timestamps out from what Samira sent me as unfortunately they had gotten all mangled, but they are in reverse chronological order, and there are more after the jump.

@katebevan to quote my favourite fictional science officer the whole experience has been….. fascinating.

@BlakeCreedon Thankyou so much.

@Schroedinger99 Thankyou for the apology.

And all this before the story’s even aired. Latecomers start here: http://bit.ly/9EZtyU

[click to continue…]

{ 18 comments }