To Buy to Sell - this is the Question


When we at FSWIRE first set out on this journey into social data for finance, we worked on the premise that social data could answer any financial question and predict the future. A financial crystal ball so to speak. Our approach, to use social sentiment to predict stock price movement in hindsight this was hugely naive. Yet inspired by research triggered by Johan Bollen and Huina Mao, professors of informatics and computing at Indiana University-Bloomington, among others (see Social Mood ConferenceBattle of the Quants). Adding fuel to the fire was Paul Hawtin launching Dement Capital Markets and their alleged $40M investment fund licensing the work from Bollen. So there must be something to it - right? So we ventured on.

There has been much copy covering the aforementioned entities so I will let you draw your own conclusion about whether this was hype, tripe or ... well you get the idea.

Anyway it is not as straight forward as it would initially seem. Sentiment of short, medium and long form text is a commodity service which is easy to engineer (if you feel the need I would suggest you start with Bayes and/or Support Vector Machines); we have rolled our own but this is more historic than anything. Anyhow sentiment alone does not produce the insight required for individuals to make decisions with, more-over one needs to look at all dimensions of the data, and fundamentally ensure that data you are analysis is suitably curated. This is a key aspect in all research done to date, for example Bollen manually manipulated the source dataset, removing content that can affect the results which is common practice.

So somewhat covertly the problem shifts from deriving sentiment analytics on social content, to one of accurate real-time curation of context sensitive content on which to generate analytics. Therefore we are now deciding what is relevant and applicable to a particular financial entity within a specific vertical/context.

Easy peasy - well not so fast man, this is fine if we know what type of content is applicable which is simple to model statically, but the interesting stuff happens dynamically and more often then not we only know it is relevant after the crucial event has occurred. So the bigger question is how do we find content that we do not know about and define a model which leverages the context and time sensitive data for the domain and entity we are interested in. Ah trending I hear you say, hmm no say I, it is much more complex than that.

As an aside there was some research released by professors at MIT showing a new trending algorithm that was identifying trends before twitters own. My only comment it is easy to identify breaking trends before they break if you know they are going to trend!

Hindsight is a wonderful thing which is a luxury we do not have!

So I guess you could say we have pivoted, kind-of. Our focus over the last 6 months has been in the following areas

  1. Improving our Financial curation models
  2. Event detection and correlation to financial entities
  3. Social Strength Indicators (SSI ®), which uses stock price to measure the impact of social data, breaking news and unknown futuristic events on a companies stock price

SSITo this end we have just added our custom SSI measure to the summary chart which indicates how well our model relates to stock price based purely on social data, it's authority and dissemination throughout the Internet in real-time which is kinda cool! What is more interesting is how this measure closely tracks price by just using our analysis of social data.

Pop over and see how your favourite stocks are doing.

blog comments powered by Disqus