Timeline: 100 weeks Amount: 10,001 to 50,000
Business domains: Media, Social
Expertise: Behavioral analytics, Predictive analytics
Technology and tools: Python, SQL & noSQl, R, mlLib, Spark
One of the most renowned EU&USA publishers reached out to us with an intention to build a new value-added service for their subscribers focused on prediction of a bill-voting outcome in US congress. Such service was offering an innovative, impartial and accurate approach to political analytics, as opposite to classic “expert” approach used by other players on the publishing market.
The journey began with deep analysis of the problem and available data to solve it. The main challenge was that in order to get high accuracy the machine learning model had to gather attributes of a bill and voting habits of senators from multiple non-congruent sources. Added complexity was the fact that both “bill” and “voter” are complex entities and specific voter decision regarding specific bill depends on multiple attributes and their combinations, therefore requiring advanced approaches for prediction.
Long story short – the model had an accuracy of 84% proving the business case. Such successful proof of concept initiated series of new initiatives and value-added services based on the data existing in the organization and utilizing the power of data science and machine learning to gain unique insights from it. This solution became a capstone for a versatile solution allowing users splice&merge datasets and predicting not only bill-voting outcome but a precursors leading to the decision of even factors, affecting behavioral pattern of a particular congress member.