Project:Data anthropology and ethics

From Wikibase Personal data
Revision as of 06:52, 13 May 2020 by Genferei (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
Social contouring.png

Thema:Social contours
Project leader: Pidoux


A non-profit project based on a sociological study, from questionnaires and tested with computational models
A methodology for adapting to local models
Social responses for communities
Open Source: models and methodology
Different from a digital contact tracing proposal
Digital contact tracing can be discussed with Paul-Olivier Dehaye also part of our team
Obtaining public funding from public institutions in different countries (Switzerland, Canada, Netherlands)
Hiring full time people
The members of this team are "consultant experts" to implement the project. We provide the questionnaire, scientific article, consulting and follow up

TogetherVsVirus 17-19 April 2020

Central document for the hackhathon

Slack invitation to Togethervsvirus:
our team channel:

Zoom meetings
Meeting ID: 920-5292-2031
Meeting password: 3zJQ97

Meetings Minutes

Public Questionnaire

SURVEY APP based questionnaire : ENGLISH

English questionnaire public link:
English questionnaire in a text document last master version official to be used for translations:
Admin access to the english questionnaire in google form:
FRENCH questionnaire (OLD Version) admin access:

Final presentation

Resources to visualize data:

Expert Questionnaire
Risk assessment questionnaire public link, to share only with epidemiologists or related fields:
Admin access to the risk assessment questionaire, to update if new questions or answers are added to the master questionnaire:

Here are the questionnaire results as we used them in the Swiss hackathon (in ./data/), the analysis script (./ and the output (in ./out/): feel free to peek and poke.

covid19 civil society /Staatslabor

Project:Data anthropology and ethics/CivilSociety

Next meeting

April 09, 2020 at 6pm for reviewing the questionnaire.


The initial challenge submitted to versus virus can be found here:


available here : Project:Data_anthropology_and_ethics/Questionnaire

Project framework


Minutes 2020/04/08

  • Use the methodology The Role of Age Distribution and Family Structure on COVID-19 Dynamics: A Preliminary Modeling Assessment for Hubei and Lombardy .

Paper + data --> Item:Q4393 data are avaialble --> file:World_Age_2019.csv
Korean data will be used to make extraprolation and valide the model.
Social contours: make assumption , then mapping on how the population looks like. .
Homophily: tendency of individuals to associate and bond with similar other

Versus Virus challenge No. 46

date: 3 april -5 April 2020

In versus virus our challenge is No. 46. Our initial challenge in the link above was merged with other submitted challenges with other purposes, we were mainly working with another team developing the prototype for the app below.


Ideas to work on:
1. NLP tool according to Paul-Olivier post on slack
2. Data analysis of questionnaires results
3. Modelling lifestyles from personas and questionnaires
participants: Jessica
4. Identifying factors of risk according to lifestyles and virus characteristics

HackVid 28.03

1. Started by public health surveillance, and their ethics and laws. Rules of privacy already allow the state to limit the freedom. Enhanced with digital systems, like asking for digital surveillance. What are limits, etc.

2. Starts from the bottom up, difficult to conceptualize. Not part of standard public health surveillance. If imagine new ecosystem of contributors, not to enhance surveillance. Should they need a framework for themselves beyond the laws. What's is the driver?
what's are the limits?
Should we need a framework?

  • Start with anthropology

whats' the model
how many models?
identify how people are living? How exposed they're?
social points of contagion
what's the implication for ethics?

700 answers from the French version of Questionnaire COVID-19 lifestyles and contagion

Very interesting input specially the supermarket visit.

11 types of at risk profiles were detected

  • kid + single parents
  • teenager + student
  • single person without children with office job (homeworking)
  • doctor in hospital (borderer worker)
  • nurse in mobility
  • delivery person
  • cashier or seller in a supermarket + risky partner
  • independent worker
  • unemployed person healed from COVID-19
  • isolated elder person
  • family with home-school

  • From the data perspective

summary of the discussion:
1) personas can be helpful as elements in a methodology to analyze the spread of a disease and the risk factors
2) data collaboratives can be used to collect the data needed to build and refine such models
3) at the same time, these data and models generate risks of stigmatization
4) personas can also be helpful as elements in a methodology to analyze the spread of stigmatization risks
5) disease risks and stigmatizations risks are structurally related, as groups at high risk of contagion face higher stigmatization risk, irrespective of other (positive/negative) characteristics
6) additionally, other characteristics (e.g. negatively perceived social traits, e.g. poverty, social deviance) combine with at risk behaviour to generate higher stigmatization risks

1. Categorization of the personas for modeling the drivers of risks of contagion and being stigmatized.
2. Three levels of analysis to the modeling:

  • risks of contagion according to the virus and COVID-19 characteristics
  • lifestyles factors that would increase the risks of contagion
  • ethical issues : stigmatization

3. Use data for public health purpose only

Which data are relevant from the personas for design)
Possible sources of data: potentially portable COVID testing.
An ultrasensitive, rapid, and portable coronavirus SARS-CoV-2 sequence detection method based on CRISPR-Cas12

[feature 1: degree of risk for themselves, likelihood to get infected] [how to know?]
-> result of a model with a lot of variables

[feature 2: Based on the configuration of the shelter, what are the degree of risk of contaminating others, likelihood to infect others] [how to know?]
variables: shelter configuration, occupation,

[feature 3: typical contagion channels] [how to know?]
variables: transportation, shared facilities, occupation, leisure, food supply, hospital, pharmacy,

[feature 4: easy to detect/contact/trace] [how to know?]
variables: level of education, gender, age, access to tech, awareness, location (urban/suburban/remote), level of trust in government & health system,

[feature 5: risk of harming by detection/contact/tracing ] [how to know?]
variables two families of stigmatization risks: A) socially undesired behaviour. e.g., illegal activities, -> have incentive to avoid surveillance -> risk is not intrinsically related to the generation of useful knowledge from an epidemiological point of view -> surveillance can try to mitigate the problem by getting as much information as possible about the disease-related aspects of behaviour, ignoring or masking the non-disease-related aspects of the behavior

B) all groups combining a high degree of risk of contagion with a high risk of contaminating others, based on demographic, lifestyle, occupational characteristics -> stigmatization risk is intrinsic to (and highly correlated with) the generation of useful knowledge from an epidemiological point of view -> surveillance can try to mitigate the problem by a) intervening on the granularity of the description of risk-related phenomena (may be in trade-off with data utility), b) adopting a language that does not revolve around concrete sociological categories keywords: stigmatization, discrimination, group exclusion.

As example, 2 related controversies that have already happened in France
- stigmatization of people living in poor suburb of Paris associating hospital saturation with assuming not respecting the quarantine (press/social media comments)
- stigmatization of health workers (neighbour leaving note on nurses' flat door asking to go live elsewhere)

twitter exchange
about this article

[feature 6: communication risks, e.g. miscommunication, fear, stigmatization] [how to assess[h] this?]
variables: age, level of education,

Challenge for other hackathons

1. identify the relationships that explain and/or predict specific personas becoming drivers and spreaders of a disease
2. identify the relationships that explain and/or predict specific personas becoming stigmatized as drivers and spreaders of a disease, and the risk factors that would lead such stigmatization having heavy consequences on their well-being
3. build an early warning system for new social categories at risk of being identified as super spreaders and suffer the highest social consequences of this association
4-How to model the factors in lifestyle that would increase the risk of spreading/contagion, and identify the virus risks associated to this propagation