Web Scraping Legal Issues

From OpenCongress Wiki

(Difference between revisions)
Jump to: navigation, search

Line 8: Line 8:
  
 
===Barriers to Access===
 
===Barriers to Access===
  (with national examples, and news coverage)
+
  with national examples, and news coverage
 +
sign-in systems (see PACER)
 +
tresspassing
 +
terms of use
  
 
===Barriers to Reuse===
 
===Barriers to Reuse===
 +
privacy laws
 +
national security laws
 +
IP laws / copyright
  
 
===Public Interest Scraping Justifications===
 
===Public Interest Scraping Justifications===

Revision as of 15:36, November 18, 2012


This page is part of the Transparency Hub project.
Add what you know.

Contents

Introduction

The public interest case for web scraping is well understood among technologists and public advocates, but often poorly understood by everyone else. A developer setting up a scraper faces an often uncertain legal context, as laws and precedents can vary different countries, and the legal issues surrounding web scraping have been only vaguely formalized or decided by courts. This page seeks to gather relevant resources about the legal concerns surrounding scraping.

Scraping and its legal context should also be normalized a bit, since legal advice often doesn't take into account public interest motivations behind copying public sector information, and treats scraping as a sort of childish malevolence, even as huge businesses are more quietly built around web scraping. We would all be better off if smaller public interest actors had that same confidence. While this page isn't intended as legal advice, hopefully it can be a helpful first step.

Barriers to Access

with national examples, and news coverage

sign-in systems (see PACER) tresspassing terms of use

Barriers to Reuse

privacy laws national security laws IP laws / copyright

Public Interest Scraping Justifications

Other Relevant Resources

http://blog.scraperwiki.com/2012/04/02/is-scraping-legal/ https://scraperwiki.com/docs/python/faq/#scraping_legality

Toolbox

OpenCongress is a joint project of the Participatory Politics Foundation and the Sunlight Foundation. Questions? Comments? Contact Us