Changes

Jump to navigation Jump to search
no edit summary
Line 1: Line 1: −
== Reactions to other talks ==
+
= Overall strategy =
=== Observation on OpenWPM ===
+
== Philosophy ==  
OpenWPM has split up into a testbed for mass crawling, and a web extension which is separate (currently used in a second project, for mass reporting to Mozilla).
+
The right of access is a tool to provide accountability for profiling, beyond challenging consent or legal basis. It is actually complementary.  
* [https://twitter.com/CartoCharles/status/1130367901767262209 Think OpenStreetMap, not Google Street View or Tesla]
     −
== Interesting queries ==
+
The right of access allows to probe the entire ecosystem, strategically. Using this strategy requires to blend knowledge of law and technology.
* [{{SPARQLEmbed|query=
  −
#defaultView:Graph
  −
SELECT ?item ?itemLabel ?_image WHERE {
  −
  ?item pdiot:P28 pdio:Q370.
  −
  SERVICE wikibase:label {
  −
    bd:serviceParam wikibase:language "en" .
  −
  }
  −
OPTIONAL { ?item pdiot:P47 ?_image. }
  −
}
  −
LIMIT 100
  −
}} adtech mess]
  −
* [https://query.personaldata.io/index.html#PREFIX%20pdio%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fentity%2F%3E%0APREFIX%20pdiot%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fprop%2Fdirect%2F%3E%0A%0ASELECT%20%3Fitem%20%3Flabel%20%3F_image%20WHERE%20%7B%0A%20%20%3Fitem%20pdiot%3AP3%20pdio:Q33.%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%0A%20%20%20%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%22%20.%20%0A%20%20%20%20%3Fitem%20rdfs%3Alabel%20%3Flabel%0A%20%20%7D%0A%20%20%0AOPTIONAL%20%7B%20%3Fitem%20pdiot%3AP47%20%3F_image.%20%7D%0A%7D%0ALIMIT%20100 datasets about the adtech ecosystem]
      +
== Strategy ==
 
* [https://lumapartners.com/wp-content/uploads/2017/01/2y103ITnIXmg2.lPu6ncZsis.wcn8DVuYIUfyLrC9cdQiWUhfOdDLq6-1024x768.png "historic" Lumascape picture]
 
* [https://lumapartners.com/wp-content/uploads/2017/01/2y103ITnIXmg2.lPu6ncZsis.wcn8DVuYIUfyLrC9cdQiWUhfOdDLq6-1024x768.png "historic" Lumascape picture]
* [http://query.personaldata.io/embed.html#PREFIX%20pdio%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fentity%2F%3E%0APREFIX%20pdiot%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fprop%2Fdirect%2F%3E%0APREFIX%20pdiop%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fprop%2F%3E%0APREFIX%20pdiops%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fprop%2Fstatement%2F%3E%0APREFIX%20pdiopq%3A%20%3Chttp%3A%2F%2Fwiki.personaldata.io%2Fprop%2Fqualifier%2F%3E%0A%23defaultView%3AGraph%0ASELECT%20%20%3Fobject%20%3FobjectLabel%20%3Frgb%20%3Fsubject%20%3FsubjectLabel%20%0AWHERE%20%7B%0A%20%20VALUES%20%28%3Fobject%20%3Frgb%29%7B%0A%20%20%20%20%20%20%20%28pdio%3AQ110%20%22FF0000%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ495%20%2200FF00%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ559%20%220000FF%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ153%20%22333333%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ491%20%22888800%22%29%20%20%0A%20%20%20%20%20%20%20%28pdio%3AQ807%20%22008888%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ492%20%22880088%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ498%20%22440000%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ810%20%22004400%22%29%0A%20%20%20%20%20%20%20%28pdio%3AQ556%20%22000044%22%29%0A%20%20%20%20%7D%0A%20%20VALUES%20%3Fpredicate%20%7B%0A%20%20%20%20%20%20%20pdiot%3AP3%0A%20%20%20%20%20%20%20pdiot%3AP28%0A%20%20%20%20%7D%20%0A%20%20%3Fsubject%20%3Fpredicate%20%3Fobject%0A%20%20SERVICE%20wikibase%3Alabel%20%7B%0A%20%20%20%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%22%20.%20%0A%20%20%7D%0A%7D%0ALIMIT%2010000 Lumascape]
      +
* Ignore the average end user. Focus on investigator, who is a blend of:
 +
** legal scholar;
 +
** tech auditor;
 +
** strategic litigation NGO;
 +
** educator;
 +
** data protection authority confidante;
 +
** journalist (for amplification).
 +
 +
It is a technical and human, but secondary, problem to break the one-brain barrier within civil society -- does require to overcome traditional barriers, like attribution (but Open Science strategies can mitigate).
 +
 +
* Build up reliable facts separately in the technical and legal domains.
 +
* Work (hard) to reconcile the two, and refine the reconciliation by iteration.
 +
 +
== Key factors ==
 +
Increase utility of information obtained through SARs in order to further the goals of people here.
 +
 +
* Knowing flows of data from a technological standpoint (but: limited auditing capacity, we are partly blind);
 +
* Knowing purposes for each data processing operation, including transfers;
 +
* Knowing the legal positioning of the services, including over each data transfer (data controller vs data processor);
 +
* Knowing to whom the user is identifiable, and how (including if more information is provided).
 +
 +
== Tactics ==
 +
* Classify services, based on business presentations. Enter at each step of the process through access requests.
 +
* Same service can exist at all scales, attack all of them
 +
* Filing a SAR is cheap, but takes time. Make it easier (schema.org lobbying, for instance --> Art 40.2.f).
 +
* Carpet bombing is A-OK, at high frequency (reducing time lag for responses).
 +
* Makes it easy to comply with access requests (standards, such as openGDPR)
 +
 +
= The landscape =
 +
== Datasets about the adtech ecosystem ==
 +
* [[WebXray domain ownership list]]
 +
* [[Related datasets]]
 +
 +
= Visualizations =
 +
{{Project:Adtech/Visualizations|state=collapsed}}
 +
 +
= A prototype =
 +
* {{Q|102}}
 +
* [[Addressing_adtech/Guardian|An example request]], generated directly from {{Q|855}}.
 +
 +
= Ontology =
 +
See [[Project:Ontology/Adtech]]
 +
 +
= Precise questions =
 
== {{Q|845}}-related questions ==
 
== {{Q|845}}-related questions ==
 
* What is the impact of a {{Q|839}} on a {{AllQ|495}}?
 
* What is the impact of a {{Q|839}} on a {{AllQ|495}}?
Line 32: Line 63:  
*** {{Q|546}} for instance facilitated the selling of an {{Q|496}} by {{Q|840}} of people more "likely to be constipated". This was determined (?) through the use of {{Q|539}}.  
 
*** {{Q|546}} for instance facilitated the selling of an {{Q|496}} by {{Q|840}} of people more "likely to be constipated". This was determined (?) through the use of {{Q|539}}.  
   −
* [[Addressing_adtech/Guardian|An example request]], generated directly from {{Q|855}}.
      
=== Tag management ===
 
=== Tag management ===
Line 66: Line 96:     
== Adtech experiments ==
 
== Adtech experiments ==
* {{Q|188}} helps get feedback quicker
+
* {{Q|188}} and subclasses help get feedback quicker
 
* {{Q|858}} defines a {{Q|94}}
 
* {{Q|858}} defines a {{Q|94}}
 
** There are problems with the [https://vendorlist.consensu.org/vendorlist.json "purposes" defined in the consent framework].
 
** There are problems with the [https://vendorlist.consensu.org/vendorlist.json "purposes" defined in the consent framework].
 
* Through cookie manipulations you can consent very flexibly. A/B test on the entire IAB, agree to A on odd day, B on even day.
 
* Through cookie manipulations you can consent very flexibly. A/B test on the entire IAB, agree to A on odd day, B on even day.
  −
== Facebook's Replacement IDs ==
      
== Consents ==  
 
== Consents ==  
Line 81: Line 109:  
Involves Europeans, but also Americans, whose data goes abroad. Drives a wedge between Facebook Ireland and Facebook Inc, etc.
 
Involves Europeans, but also Americans, whose data goes abroad. Drives a wedge between Facebook Ireland and Facebook Inc, etc.
   −
== What next? ==  
+
== Facebook's Replacement IDs ==
 +
Crucial in the context of WhatsApp/Messenger/Instagram merger.
   −
=== Links to competition ===
+
Facebook implements the entire ecosystem into one product. This is a challenge for them in blurring between pseudonymization and anonymization. They implement all of this into one big association table. See {{Q|860}}, page 46 of 57.
Privacy is a dead-end for enforcement. It will always boil down to consent and design, and always be abused in unaccountable ways, in order to get the first users. And then it will expand progressively to everyone (cf. Facebook's experiments to get my consent in order to use facial recognition).
     −
On the other hand, tracing data flows -- which is possible thanks to data protection law -- helps define exactly the assets that are being shared. It's not just about raw (personal) data, it's also about the consents associated to this data, as well as its identifiability. Once all this is taken into account, we will have a better understanding of the personal data market, and will be better able to assess dominance of some players over that market. This might open the door to a more reasoned antitrust action.
+
= What next? =
 +
 
 +
== Question on what is observable in RTB ==
 +
There is a bias in what researchers study. What is it?
    +
== Links to competition ==
 +
Privacy is a dead-end for enforcement. It will always boil down to consent and design, and always be abused in unaccountable ways, in order to get the first users to adopt new intrusive technologies. And then it will expand progressively to everyone (cf. Facebook's experiments to get my consent in order to use facial recognition).
    +
On the other hand, tracing data flows -- which is possible thanks to data protection law -- helps define exactly the assets that are being shared. It's not just about raw (personal) data, it's also about the consents associated to this data, as well as its identifiability. Once all this is taken into account, we will have a better understanding of the personal data market, and will be better able to assess dominance of some players over that market. This might open the door to a more reasoned antitrust action.
   −
=== Observations on right of access ===
+
== Observations on right of access ==
    
  GDPR Art. 15 (Right of Access)
 
  GDPR Art. 15 (Right of Access)
Line 107: Line 141:       −
=== Schema.org suggestion ===
+
== Schema.org suggestion ==
 
Someone should suggest to schema.org to add GDPR related terms (like "data controller", "access request email endpoint", etc).
 
Someone should suggest to schema.org to add GDPR related terms (like "data controller", "access request email endpoint", etc).
 +
 +
== Observation on OpenWPM ==
 +
OpenWPM has split up into a testbed for mass crawling, and a web extension which is separate (currently used in a second project, for mass reporting to Mozilla).
 +
* [https://twitter.com/CartoCharles/status/1130367901767262209 Think OpenStreetMap, not Google Street View or Tesla]
 +
* --> Build toolkits/diffuse computing?!
    
== Relevant items ==
 
== Relevant items ==
Line 120: Line 159:  
* {{Q|559}}
 
* {{Q|559}}
   −
== Interesting queries ==
+
== See also ==
* [{{SPARQLEmbed|query=
+
* [[Project:GetYourData]]
SELECT ?item ?label ?_image WHERE {
  −
  ?item pdiot:P3 pdio:Q110.
  −
  SERVICE wikibase:label {
  −
    bd:serviceParam wikibase:language "en" .
  −
    ?item rdfs:label ?label
  −
  }
  −
OPTIONAL { ?item pdiot:P47 ?_image. }
  −
}
  −
LIMIT 100}} adtech companies]
  −
* [{{SPARQLEmbed|query=
  −
SELECT ?item ?label ?_image WHERE {
  −
  ?item pdiot:P3 pdio:Q495.
  −
  SERVICE wikibase:label {
  −
    bd:serviceParam wikibase:language "en" .
  −
    ?item rdfs:label ?label
  −
  }
  −
OPTIONAL { ?item pdiot:P47 ?_image. }
  −
}
  −
LIMIT 100}} data management platforms]
  −
* [{{SPARQLEmbed|query=
  −
SELECT ?item ?label ?_image WHERE {
  −
  ?item pdiot:P3 pdio:Q559.
  −
  SERVICE wikibase:label {
  −
    bd:serviceParam wikibase:language "en" .
  −
    ?item rdfs:label ?label
  −
  }
  −
OPTIONAL { ?item pdiot:P47 ?_image. }
  −
}
  −
LIMIT 100}} identity resolution services]
  −
*[{{SPARQLEmbed|query={{SPARQLPredicateGraph
  −
|filter_values_colors=(pdio:Q110 "EEEEEE")(pdio:Q495 "222288")(pdio:Q559 "8822222")
  −
|predicate=pdiot:P3
  −
|predicates_subjects_objects_colors=(pdiot:P111 pdio:Q495 pdio:Q559 "EEEEEE")(pdiot:P111 pdio:Q495 pdio:Q559 "222288")
  −
|legend=true
  −
}}}} actors]
 

Navigation menu