Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standardize to canonical company name #19

Open
ydoc5212 opened this issue Jul 30, 2021 · 3 comments
Open

Standardize to canonical company name #19

ydoc5212 opened this issue Jul 30, 2021 · 3 comments
Assignees
Labels
enhancement New feature or request standardization Mapping fields to canonical names, standardizing values such as dates, and other value add

Comments

@ydoc5212
Copy link
Member

Using OpenCorporates API? (https://api.opencorporates.com/)

@ydoc5212 ydoc5212 added enhancement New feature or request standardization Mapping fields to canonical names, standardizing values such as dates, and other value add labels Jul 30, 2021
@ydoc5212
Copy link
Member Author

ydoc5212 commented Aug 2, 2021

Note for CT company name pre-processing:
-use regex filter to remove parentheticals and asterisks marking updated revisions {eg Sodexo (Updated Notice)*}

@ydoc5212
Copy link
Member Author

ydoc5212 commented Aug 16, 2021

- [ ] see if opencorporates have python API client

- [ ] send email, CC serdar, asking for api key

  • submit public benefit application for api key
  • receive api key

@ydoc5212 ydoc5212 self-assigned this Aug 18, 2021
@Ash1R
Copy link

Ash1R commented Jan 23, 2023

How would we use this API for this task? Is it to remove things like "Updated Notice" by finding the company name in a string, or would we use CorporateGroupings to find constituent companies, or something else?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request standardization Mapping fields to canonical names, standardizing values such as dates, and other value add
Projects
None yet
Development

No branches or pull requests

2 participants