Hey @sourabh thanks for that info. I hadn’t seen that Glassdoor page about the free API being closed. That’s a shame. Be interesting to see what they come back.
Google Search Console
Right, got you. It was actually the Google Search Console I used. I picked up that tip from one of the Webinar video recordings over on Youtube. That was possibly from before the integration with LinkedIn (as the webinar mentioned circumventing going through LinkedIn’s API).
I’ve been able to replicate your configuration and skip through the results (per the tip mentioned in that webinar at this specific point). It’s one of those things where for this use case, it sort of works with a handful of records but it’s very flaky I’m feeding in variables along with the company name to make sure I get the right company e.g. Zoom Video (in a half baked attempt to avoid the wrong company, say Zoom Info). Marginally better than looking them up manually.
I managed to pull through “CEO approval rating” as a snippet but couldn’t find a formula to extract the % value only from the snippet (Is there one?).
Similarly, I needed to implement a results Id reference in another cell to be able to pull through the most appropriate snippet i.e. from the right company and something dated recently. Using the console route is very much an art as you say!!
Overall, my approach here is not very scalable. Even if I got it working, with search results moving around, it breaks and there’s a high risk of parsing the wrong company record and/or something that’s out of date on the snippet front.
Glassdoor also has reviews for some companies split by office GEO. But from what I can tell they’re still grouped under an overall employer record. It’s hard to tell when going via the Google search console unless you’re in the browser. I expect the API makes it easy to retrieve the overall one. But some companies seem to deliberately split their Glassdoor ratings (international level it seems). E.g my former employer has .com Glassdoor score, it’s US-centric (and for clarity I mean the record is listed as a separate employer), whereas the .co.uk is UK-centric (albeit I’m sure it’s a mix of the two GEOs (given job listings are for both countries). If I get API access, may just have to ignore that unless I can solve the associated domains/domain extension issue and group them under one record.
Overall, I’d be better with the API. At least I should be able to get the overall rating for a given domain. But the source data will have other nuances and quirks. Age-old data problems!.
Anyway, the Google Search Console route was worth a shot. Thanks for your help there.
Associated domains / Domain extensions
I was going to ask in my first post whether finding extensions was doable. I only spotted the Intercom example because in another test query with the Trustpilot integration it only returned 2 ratings based on the .com. I knew they had the .io and sure enough, it returned something more plausible rating count wise.
You can probably guess where I’m going with this, but the domain (domain path? is that the right term) is my key identifier for a company. In an effort to close some gaps, I was looking for an automated way of finding associated domains that I could then build up from. But, you’ve answered the question.
I’m surprised none of the company data aggregators do this in an effort to clean their data. There are definitely a lot of duplicate records (on Dealroom etc.) but I guess it’s hard to distinguish whether it’s the same “company”, a subsidiary or something else (without human intervention and the company themselves feeding into that).
Duplicate records aside, it’s a pity none of them lists alternative domains. A possible workaround that I’m thinking of that would probably cover 90% of cases is hooking into multiple APIs based on a parameter other than website domain (Twitter account URL, Linkedin URL) and then retrieving the website domains and doing an automated cross-reference. And then go from there.
Case in point Crunchbase lists Intercom’s .com domain while Dealroom lists the .io but they share other parameters in common.
Company Domain Search
Somewhat related to that (didn’t seem like a separate thread was worthwhile albeit this response is long!!). For company domain search, how do I know I’ve got the right one? Is it a case of manually checking, checking against Crunchbase?
Is there a function where it can return more than one response? Take Clubhouse as an example, there’s clubhouse .io (the project management software) and joinclubhouse .com (the Clubhouse suffering a bit of a ). Similarly, there’s Bolt, one is the online checkout company (.com), the other is mobility (.eu) …and sure there might be more Bolts. If there isn’t such a function to retrieve more than one response, how does the Find Domain function choose which one to retrieve? Does it pick the first one based on Google ranking for the company name?
Haha, I think I’m really wishing someone would have created a company equivalent of the ISBNs they use for books, cause companies rebrand too and change domains like dashdash to Rows
EDIT: I can only post 2 links as a new user so had to leave some spaces in some domain refs above.