view the rest of the comments
datahoarder
Who are we?
We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.
We are one. We are legion. And we're trying really hard not to forget.
-- 5-4-3-2-1-bang from this thread
I don’t currently have adequate understanding of different SEC FORMS, and although I can use python I have no experience writing web crawlers at the moment.
I dont have any experience with understanding SEC forms either. Is there a repository for SEC forms? Or do you imagine looking at all companies website to mine for those forms?
SEC has the Edgar database where you can lookup any company and access there different SEC forms, but you still need to know which forms to look for the information in. For example, the 10k of one company had the ownership informing of top shareholders, but I wasn’t able to find that info in the 10k of another company (possibly because I didn’t know where to look). I know you can use EDGAR database to at least lookup these forms, but I do not know the full capabilities of the database (such as if you can query for ownership directly) because I just discovered it the other day.
It looks like EDGAR has a rest API and a full text search as well