Over the past few months I have been building a data pipeline that extracts quotes and assigns them to company executives, using natural language processing (spacy).
I am focusing on the tech industry right now which I know pretty well (I used to work at CB Insights). Every Wednesday I put together a newsletter featuring interesting quotes and explain why they matter.
Eventually, Oversaid will be a platform that lets you search quotes yourself and come up with your own analysis, but I wanted to launch something early while I build out the product so that I can learn from prospective customers.
I'm having a lot of fun writing these up and I hope you enjoy reading them!
Hey, this looks kind of nice actually. But why hide the content behind an email sign-up, why not also provide an RSS feed, or at least the latest 1-3 send-outs? If you hadn't provided that link to the latest e-mail I would've brushed it off entirely and not given it a moments more thought. But you did provide that link, and I found the content relevant and interesting, and even dove deeper into a couple of the articles.
The content in this one e-mail is pretty good, but before I sign up I'd like to see more. But also it's not clear to me that I'm your customer, because reading your post here it sounds like your customer isn't the consumer of these feeds but actually the producers? I'm not one of them, so why do you care about my e-mail address, other than to sell to your actual customers down the line?
The main reason I'm starting this newsletter is to start conversations with people about this data, which is why I want people's emails.
Right now I'm hoping people are enticed enough to put their email address to check it out. I might be wrong about that and I might add a link to the latest email on the home page in the future to see if signups improved.
I have no intention of selling those emails. My goal is to start a Saas platform based on this data. A very small number of subscribers will hopefully become clients. The rest can just enjoy the newsletter and help me get the word out if they feel like it.
from what the OP says, it seems that this would eventually be a paid service (with a free newsletter) for people (maybe journalists) to easily find quotes/tidbits so that they can... well, do whatever they'd like with that information.
The sample letter refers to the press. Executives tend to be general with the press. Investors make them talk specifics. That happens during quarterly earning calls. Which would be a good source. Seeking Alpha has the transcripts.
The SEC filings are another liable source. Though not personal quotes, they include perspectives on tech trends.
1) I am using the press as a kind of filter of what's interesting (hypothesis: they pick the most interesting thing executives are saying on earning calls)
2) I am working on extracting other sources like earning transcripts and twitter, but this will be tougher for me to go through due to the sheer amount of things that are said in those sources
SEC is a good one too, the management analysis section of filings is always interesting so I could extract that.
I must say I found the "irrelevant commentary" rather entertaining, and I think it really enhances the value of the content. it's just the right amount to be fun, yet not overwhelming, keep at it!
Thanks! Yea the backend is all python (using spacy.io for NLP), and I built an admin in Rails to search through the quotes and correct the quotes when needed. Everything is stored in postgres.
I initially thought I would be able to manually review each quote (which was the main reason I built the admin), but it quickly became obvious that it was just too many quotes so I had to improve quote detection and attribution.
Big thanks to whomever wrote the spacy doc, I knew nothing about NLP (and I still know next to nothing tbh), but I was able to start using it pretty quickly.
Sorry in advance for the rather harsh feedback, that follows. It’s basically my unfiltered impression when I visited your site before reading your introduction comment here.
This is one of the occasions, when I love the German “every website needs a ‘Who’s responsible for this?’ page” law (a.k.a. “Impressumspflicht”). Why should I trust your specific selection of quotes and their interpretation? You even don’t trust your visitors with a “Who are we?” section. For all I know, this could be a Chinese or Russian troll factory outlet sale.
For this to work (for me, at least) you need to work _way_ more on the site’s transparency than a more or less default privacy disclaimer and an e-mail input form: Who am I, what criteria and sources are used for the quotes, how are they categorized, what do I do to prevent bias... The technology may as well be sound and state-of-the-art, but if I don’t trust the website, I won’t sign up to anything.
I don't know if it's a transparency issue rather than an editorial perspective. I don't really know or care who gave me my CNN/Fox/BBC/ITV news, but I care that it has a consistent and trustworthy editorial perspective, that may be carried over multiple people.
I guess transparency and a greater overarching perspective both work.
Love this idea. And congrats Alex - CB Insights Mafia!
Things that I think folks (aka me) would find interesting:
1. Impressions/views on tech markets (what they're entering, their views on growth of markets, etc)
2. Views on competition esp if they talk isht
3. Quotes with data. Because some of these products by tech cos are opaque with stats, if execs drop a figure about growth, that's valuable.
4. Over time, it'd be cool if you could see what products execs talk about in their public comments as that might give an indication of what they're focused on. Suspect they might not talk a lot about specific products so perhaps wishful thinking.
IMO, personally, I think quotes on politics are boring mainly cuz that doesn't give insight into the biz and cuz politics is already everywhere.
Enjoying the emails so far and look forward to seeing where Oversaid goes
For #3, you may want to include guest engineering blog posts shared to highscalability [1]. There are sometimes growth hints and stats included in the posts.
Over the past few months I have been building a data pipeline that extracts quotes and assigns them to company executives, using natural language processing (spacy).
I am focusing on the tech industry right now which I know pretty well (I used to work at CB Insights). Every Wednesday I put together a newsletter featuring interesting quotes and explain why they matter.
Eventually, Oversaid will be a platform that lets you search quotes yourself and come up with your own analysis, but I wanted to launch something early while I build out the product so that I can learn from prospective customers.
I'm having a lot of fun writing these up and I hope you enjoy reading them!
Here's the last email I sent: https://mailchi.mp/ab609ce94c60/bill-gates-gives-bezos-some-...