All functions

check_data_availability()

Check data availability for the individual source

check_data_irs_web()

Check data availability for IRS and website

check_for_grantmaking_activity_990()

Check for grant-making activity from 990 forms

clean_program_desc()

Clean the descriptions of the programs run by a particular organization

combine_texts()

Combine multiple webpage texts into one shared text with headers and footers removed

example_function_for_single_org()

Get various information about an organization

extract_about_links()

Extract links and other information related to about page

extract_about_page_content()

Extract about page content

extract_financial_info()

Extract financial information from 990 forms

filter_null_grant_info()

Filter null grant information

find_about_link()

Find links and other information related to about page

find_all()

Find Twitter handles either using a name (default strategy) or an website of an organization.

find_facebook_page_from_org_page()

Parse Facebook handles from an organization website

find_footer()

Find a common footer string in a set of texts

find_header()

Find a common header string in a set of texts

find_possible_link()

Find a possible link based on a base URL

find_search_term()

Check whether the href attribute contains "search_term" or not

find_twitter_handle_from_bing()

Find Twitter handles using Bing API

find_twitter_handle_from_org_page()

Parse Twitter handles from an organization website

get_990()

Get the XML root element associated with a particular Employment Identification Numbers

get_about_page_content()

Get about page content

get_all_financial_data()

Get all financial fields for a given org

get_all_texts()

Extract all about page texts for a given organization

get_aws_url()

Get the Amazon Web Server URL associated with a particular Employment Identification Numbers

get_aws_url_from_server()

Get the Amazon Web Server URL associated with a particular Employment Identification Numbers

get_contact_links_from_website()

Get child links associated with email contacts

get_email_contact_from_webpage()

Get contact webpage content

get_emails_from_website()

Get emails from a base url

get_filing_type_990()

Get 990 filing type

get_financial_details_990()

Get financial details from 990 forms

get_financial_details_990ez()

Get financial details from 990 EZ forms

get_foundation_ein()

Get Employer Identification Numbers (EINs) associated with foundations

get_grantmaking_details_990()

Get grant-making details from 990 forms

get_links()

Get links base on base URL

get_organization_name_990()

Get the name of the organization associated with a particular Employment Identification Numbers

get_scheduleO()

Get concrete information from Schedule O

get_scheduleR()

Get concrete information from Schedule R

get_single_value_990()

Get concrete information from 990 forms

get_value_990()

Get concrete information from 990 forms

if_not_about()

Check whether the href attribute contains "about" or not

if_not_search_term()

Check whether the href attribute contains a user-defined "search_term"

ifnotNA()

Simplify ifelse process

import_idx()

Import machine-readable data from 990 forms filed with the IRS

irs_fields

Data fields available from IRS 990 forms

irs_index

Index table of all available IRS filings

parse_by_length()

Extract significant text from html source

parse_facebook_page_from_page()

Parse Facebook handles from a webpage

parse_twitter_handle_from_page()

Parse Twitter handles from a webpage

search_all_activities()

Search all activities

standardize_990_flag()

Standardize 990 flag

standardize_url()

Standardize the website URL of an organization

url_exists()

Check if a url exists with httr