Data Ingestion for Enterprise Data Platforms

May 5, 2014

http://hortonworks.com/wp-content/uploads/2014/03/Oil-and-Gas-Ref-Arch.png

The Ingestion Box in the reference architecture is displayed as the smallest box.  However, this is the component that integrates with all the available data sources.  This tends to be among the most complex and time consuming task, but tends to be relegated to a lower priority which is a big mistake.

One needs to prioritize the data sources that generate maximum value and ensure we can ingest the data into the Big Data platform for subsequent “cool” analytics.

In my experience, it is also extremely important to have a robust User Interface for the ingestion section.  Otherwise, there could be a series of manual steps leading to errors and ingestion of “bad” data that will minimize impact of subsequent analytics.

 

 


E-Discovery Integration with Archive Solutions

September 19, 2013

Why should email archiving solutions be tightly integrated with eDiscovery ?

It enables enterprises to have a smooth workflow across all aspects of e-discovery, from collection/preservation to analysis/review to production/presentation. This saves a lot of time that would otherwise be wasted on importing/exporting data from different systems, and reduces the risk that something gets lost in the shuffle.


Is Exchange 2013 ready for Litigation Hold

September 16, 2013

It is not ready according to Greg Buckles – http://ediscoveryjournal.com/2013/08/exchange%E2%80%99s-litigation-hold/

The following critical information is not held:

  • Folder Location
  • Read/Unread Status
  • Flags
  • Categories
  • Reply/Forward Info

Also check out this comment on readiness of Exchange 2013  – http://sourceoneinsider.emc.com/2013/02/15/understanding-whats-new-in-exchange-2013-for-archiving-and-ediscovery/

Exchange 2013 is unlikely to be sufficient for organizations facing a higher frequency of litigation and who have invested in in-house legal staff to perform early case assessment and sophisticated review of content before exporting such content to external legal counsel. –


IGRM model is circular for a reason

May 5, 2013

The information Governance Reference Model is circular so it works best when there is a continuous focus on IG. One of the biggest failures in Information Governance initiatives is that It is treated as a project with a start and end date. These initiatives are only successful if the end goal is to incorporate IG as part of required operational tasks.


My favorite Information Governance Commandment

March 12, 2013

Check out Bhandari’s 10 commandments – http://tinyurl.com/aaxjehy

My favorite “To Have and to Hold is Good, but not til Death Us Do Part”

Recommendation is not to keep data indefinitely because storage is cheap. When you get into dispute the cheap storage suddenly becomes expensive since you might have to collect at source and have it reviewed by lawyers.   Vendors with Predictive coding technology have the foundation to build a solution for defensible deletion and reduce the volume at source.

 


Pressure on Information Governance

December 30, 2012

Improvements in search technology, enables individuals broad and immediate access to firm and client information.   Firms may discover that minimal restrictions on document security and powerful search tools may lead to embarrassing situations.  A thorough Information Governance policy is becoming critical to organizations.

The forces driving need for an Information Governance policy include Compliance, Risk Management, Security and International laws.  Forces Information Governance advocates have to contend with include Business needs, Access to information & need for unrestricted research.

Do you have an Information Governance Framework in place?  Is that one of your firm’s Goals for 2013?