Skip to content

Scripts for downloading data from lists of mutual aid groups

License

Notifications You must be signed in to change notification settings

factn/mutualaid_meta_downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Meta Data Downloader

Quick and dirty scripts to merge the common-fields of Mutual Aid like listings from a number of sources into one big list.

This is a ResilienceApp internal tool, for analysis and review purposes only. It is not intended to be a public replacement for the awesome work done at the following great sites, which are the true sources of this data and which continue to hold all relevant copyrights - not to mention much more detailed and up to date data.

Goals:

Gathering as many mutual aid groups as possible, and merging/deduplicating them into an existing list. We have a few scripts already. This can be valuable even for research into what makes some mutual aid groups succeed while others fail. We do not want to link people to groups. Others are already doing that work. Our list doesn't need to be very detailed, just the essentials (meta-list).

Slack Channels:

#collab-reach4help-mutualaidworld #resilience-outreach

Problems:

  • We need a natural language type parser to automate what kind of type of information each site has:

  • How do we separate the bigger groups from the rest?

  • How do we separate mutual aid groups from food banks/information sites, etc?

  • Each site has a different data scheme.

Sources to scrape:

Scraped so far:

Installation

Install node 10.xx or above.

git clone git@github.com:factn/mutualaid_meta_downloader.git
cd mutualaid_meta_downloader
npm i

Usage

  1. Update any manual edits

Update manual_classifications.csv from AirTable

  1. Download data
node download_mutual_aid_hub.js > mutualaid_hub.csv
node download_mutualaidwiki.js >  mutualaid_wiki.csv
node parse_na_mutual_aid_networks_datasource.js > na_mutual_aid_networks.csv
node parse_reach4help_markers.js > reach4help.csv
  1. Run merge
node dedupe_and_merge.js > all_data.csv 

Data

Last time the script was run it produced..

all_data.csv

About

Scripts for downloading data from lists of mutual aid groups

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published