where content, technology and people meet. (SM) Publishing and content technology executives use Shore to measure and understand their markets and competitors, define marketing strategies and implement successful content products and services using Shore's highly actionable insights into vendors, institutions, individuals and virtual communities.
ContentBlogger is the 2007 SIIA CODiE Award Winner for Best Media Blog
COMMENTARY:

Insights and headlines from Shore analysts on trends in enterprise and media content markets.
  Subscribe to our feed (?) or add to: MyYahoo  iGoogle/Google Reader  Bloglines  NewsGator  Rojo
Friday, August 25, 2006
Investor Relations magazine reports on an interesting approach to openness in Web text mining taken by Quoza.com, an online service that focuses on extracting content from the Web sites of more than 7,000 public companies and news sources covering those companies. Much of this content appears on investor relations sites, many of which are hosted by Thomson Financial's CCBN corporate communications service. When Quoza's crawlers were getting so aggressive that they started to skew CCBN's Web site report statistics they were tipped off - and ticked off enough to bar Quoza crawlers. Quoza responded with an email blitz to CCBN clients suggesting that they check with their lawyers as to whether Thomson's actions were putting their companies in jeopardy of violating U.S. Sarbanes-Oxley Act Section 409 real-time issuer disclosure regulations. Needless to say, this caused quite a stir in corporate communications circles.

It's an interesting play to protect Web crawling, but it may be on shaky legal ground. The CCBN service is already exposing content to the public, while services such as Quoza are simply helping to accelerate the redistribution of this content. Quoza provides an aggressive crawling scheme, hitting sources once each minute on a 24-hour basis. This puts it in the zone of being potentially subject to the Computer Fraud and Abuse Act (CFAA), which has been used in a number of instances to rein in aggressive access to Web sites and other electronic facilities. The real question hinges on a key phrase in SOX 409 which says that information must be disclosed by corporations to the public on "an urgent basis". Is posting something on a Web site really "urgent" distribution? If there are distributors willing to do better, shouldn't public records be available to them on an urgent basis?

While good legal teams could
push this to Thomson's favor without too much difficulty, there are reasons enough for them to rethink their approach to this situation. It's a simple enough fight to take on a renegade redistributor of public information, but what would happen if corporations with their own crawlers were excluded? That would be a tougher fight, no doubt, and a greater threat to service performance. Quoza could stand to get some marketing savvy and work with services suppliers such as Thomson to share the wealth from premium services derived from their proactive crawls so that their infrastructure costs could be born fairly. But at the same time suppliers like Thomson could get smart and recognize that there are great opportunities in distributing public information of all kinds far more aggressively than most IR site services are equipped to support. There are no clear heros or villains in this tiff but plenty of opportunity to make the most of public content.

By John Blossom - posted at 4:55 PM
permanent link to this entry        bookmark this entry:  AddThis Social Bookmark Tool
  6 comments (click to view or to add your own) 
Comments: 
I read this on Quoza's website:

"Note: Approximately 70% of The Thomson Corporation [TOC] is controlled by one Canadian family. So America news is controlled by Canadians. Do we want something this important to American fortune controlled by foreigners in today’s uncertain times." Full diatribe is here:http://www.quoza.com/jsp/whyus.jsp

How can anyone take this man seriously with this kind of argument?
 
Dominic,

Thanks very much for your comment. I think that you bring up an important point - the Quoza approach comes with a bit of an edge, as reflected by their approach to crawling as well as in their polemics. I don't think that Quoza is a very sophisticated approach to text mining by any estimate but in the process of pushing the limits they've raised some pretty interesting legal issues - some of which are not likely to favor them based on how they've put their case forward.

But at the same time Thomson and other vendors could be doing a lot more to package IR content aggressively. While RSS would probably not be the right approach from a technical standpoint, conceptually there needs to be an easier way for corporations to get a hold of this content without resorting to proprietary feeds. I think that we'll see more legal conflicts arising with the distribution of public information over the next few years.

Many thanks,
John Blossom
 
John,

I'm not sure I understand the problem that quoza solves. Indeed, I don't know that there is a problem.

The information they are scraping from companies' sites is distributed first via wire services like PR Newswire, BusinessWire, MarketWire and PrimeZone. It is also filed with the SEC, and then redistributed again via services like EDGAR Online, 10-K Wizzard and a host of others.

The materials that end up on the companies' website come via these sources, often with a slight delay. So if quoza really wants the information, they can get it from a variety of sources.

Quoza says it wants to check that companies are posting the information on their Websites in a timely manner. Well, while they should do so, there's no legal requirement for companies to even have an investor relations website. Furthemore, news releases posted on companies' websites don't constitute adequate disclosure. So they're chekcing for something that is not considered by the regulators to be a requirement.

The disclosure wires have broad distribution. They are typically picked up by hundreds of sources on the Web. I would like to see wider distribution and a more open system, but it still seems to me that current practice is aggressive distribution.

What I find interesting is Thomson Financial deciding to block access to content they host on behalf of their clients, about 1,800 public companies. I doubt they got permission to do this before hand. It raises the question of who owns the content when companies outsource their IR webpages to a vendor like Thomson.

I agree with you completely that new technologies will see more scraps like this.
 
Yes, I agree that Thomson's blocking of sites managed for clients is troublesome, especially since they're supposed to be public outlets that fulfill legal posting requirements. Agreed also that the value proposition of Quoza is rather sketchy in terms of some of the other public sources they're adding that are managed well by other sources. But in general BizIntel services will be doing more and not less of this type of aggregation...
 
In talking about control frameworks like COBIT or COSO, people often ignore or pay less attention to the monitoring component of their controls. Companies are now integrating continuous monitoring as both a control and an automated control test. For more information check out this Forrester webcast: http://www.oversightsystems.com/knowledge/view_Controls_Automation_webcast.php

Webcast with Forrester Research: Controls Automation & Continuous Monitoring

Date: Tuesday, Sept. 26

Time: 1 p.m. EDT/10 a.m. PDT

Duration: 45 minutes ngoing

Sarbanes-Oxley compliance demands controls optimization and continuous monitoring. In the first years of internal control audits, companies labored to satisfy their auditors with manual controls that were costly to implement and then required intensive testing. Forrester Research analyst Paul Hamerman will lead a 45-minute discussion on how companies can take their SOX compliance programs to the next level with controls automation and continuous monitoring. Specifically, Paul will discuss:

* Risk-based controls (and how to implement them)

* Automating compliance processes

* The role of continuous monitoring as a control and control testing

* Business benefits from compliance
 
Post a Comment
 

To top of page To Top of Page

COMMENTARY: INDEX
CONTENTBLOGGER
INDUSTRY EVENTS
CONTENT NATION

Read ShoreLines, our free weekly email newsletter.

Sample issue
Follow us on Twitter
Get headline-only feed
Buzz news comments
RECENT ENTRIES
READ CONTENT NATION

Learn how to thrive and to survive as social media changes our work, our lives and our future.
Buy the book
Read it online
Read our social media blog
WEBLOGS: ARCHIVES
 
 

shorename.gif (1190 bytes)
[HOME] [US] [SERVICES] [COMMENTARY] [RESEARCH] [EVENTS] [PRESS] [CONTACT]
Copyright © 1997-2009 Shore Communications Inc.  All Rights Reserved - Click Here to Read Terms of Use
Corporate Privacy Policy

 

 

 

 

 

 

 

 This page is powered by Blogger. Isn't yours?