FINIF mini review: Part 2
Jinfo Blog
7th March 2012
Abstract
In Part 1 of this review, we looked at the basic premise behind FINIF, a financial informatics platform.
FINIF is not only unique in that it is one of a very few in the financial sentiment analysis niche...
Item
In Part 1 of this review, we looked at the basic premise behind FINIF, a financial informatics platform. FINIF is not only unique in that it is one of a very few in the financial sentiment analysis niche (see also the recent Opfine review by Scott Brown – http://web.vivavip.com/go/livewire/66640) but the creators, University of Notre Dame Professor Dr Bill McDonald and his son, FINIF CEO Michael McDonald, have also created a proprietary dictionary of key words that is specific to filings that US companies must make to the Securities and Exchange Commission (SEC).
In short, McDonald discovered that many of the sentiment words used in analysis platforms came from psychological and sociological based dictionaries, such as the Harvard Dictionaries. This often leads to misinterpretation when it comes to business or financial documents. Often, a term that may have a negative connotation in the marketing world, such as “vice”, is used rampantly in a 10-K filing to describe vice-president. Add that to the practice in sentiment analysis of providing a great deal of weight and focus to what are considered negative words, and you have a recipe for misguided financial sentiment analysis.
As a matter of fact, Dr McDonald even co-authored an article in 2009 for the Journal of Finance entitled When is a Liability not a Liability: Textual Analysis, Dictionaries and 10-Ks (http://www.afajof.org/afa/forthcoming/6989p.pdf) (PDF). In the article, McDonald describes the results of sampling 10-Ks during 1994 to 2008, and finding that almost three quarters of the words identified as negative by the Harvard Dictionary were words not considered negative in financial contexts. Some examples include: tax, capital, board, foreign, order and depreciation.
In order to combat this problem for financial sentiment analysis, the McDonalds and other colleagues sat down and went through each word in a dictionary and identified those that they felt would most frequently be seen as reflecting a particular type of sentiment or attribute. They then back-tested the lists by creating word counts based on all 10-K filings and carefully examined words that seemed misclassified, typically due to alternative uses.
Dr McDonald has also created six dictionaries behind a Dow Jones product called Lexicon, which uses the proprietary sentiment dictionary to create custom sentiment models. It is actually financial information and data providers, such as Dow Jones, that have expressed the most interest in FINIF, aside from investment and hedge fund managers.
Though FINIF had at one time operated on a subscription model, it has since pulled back in order to further build out the database and possibly partner with others in order to scale the business. So, I would say you could classify it as in post-Beta development.
FINIF certainly has taken on a niche in sentiment analysis and drawn some attention. It will be interesting to see how things unfold for the McDonalds and FINIF in the near future.
- Blog post title: FINIF mini review: Part 2
- Link to this page
- View printable version
- FINIF mini review: Part 1
Tuesday, 6th March 2012
Community session
11th December 2024
2025 strategic planning; evaluating research reports; The Financial Times, news and AI
5th November 2024
How are information managers getting involved with AI? Navigating privacy, ethics, and intellectual property
- 2025 strategic planning; evaluating research reports; The Financial Times, news and AI
5th November 2024 - All recent Jinfo Subscription content
31st October 2024 - End-user training best practice research
24th October 2024
- Jinfo Community session (TBC) (Community) 23rd January 2025
- Clinic on contracting for AI (Community) 11th December 2024
- Discussing news and AI strategies with the Financial Times (Community) 21st November 2024