Tag Archives: collect

Defining Truly Cloud-Capable eDiscovery Software

Last week we discussed the challenges of searching and collecting data in Infrastructure as a Service (IaaS) cloud deployments (such as the Amazon cloud or Rackspace) for eDiscovery purposes.  Today we discuss what is needed for eDiscovery and enterprise search vendors to provide a truly cloud-capable solution and provide a decoder ring of sorts to cut through the hype.  For there is a lot of hype with the cloud becoming the latest eDiscovery hot button, with vendor marketing claims far surpassing actual capabilities.

In fact, many eDiscovery and enterprise software vendors claim to support the cloud, but are simply re-branding their long-existing SaaS offerings, which really has nothing to do with supporting IaaS. Barry Murphy of the eDiscovery Journal aptly identified this marketing practice as “cloud washing.” Data hosting, especially where the vendor’s manual labor is routinely required to upload and process data, does not meet defined cloud standards. Neither does a process that primarily exports data through APIs or other means out of its resident cloud environment to slowly migrate the cloud data to the vendor tools, instead of deploying the tools (and their processing power) to the data where it resides in the cloud. In order to truly support IaaS cloud deployments, eDiscovery and enterprise search software must meet the following three core requirements:

1.         Automated installation and virtualization:  The eDiscovery and search solution must immediately and rapidly install, execute and efficiently operate in a virtualized environment without rigid hardware requirements or on-site physical access. This is impossible if the solution is fused to hardware appliances or otherwise requires a complex on-site installation process. As hardware appliance solutions by definition are not cloud deployable and with enterprise search installations often requiring many months of man hours to install and configure, whether many of these vendors will be able to support robust IaaS cloud deployments in the reasonably foreseeable future is a significant question.

2.         On-demand self-service: In its definition of cloud computing, The National Institute of Standards and Technology (NIST) identifies on- demand self-service as an essential characteristic of the cloud where a “consumer can unilaterally provision computing capabilities, such as server time and network storage, as needed automatically without requiring human interaction with each service provider.”

Many hosted eDiscovery services require shipping of data to the provider or extensive behind the scenes manual labor to load and configure the systems for data ingestion. Conversely, solutions that truly support cloud IaaS will spin up, ingest data and fully operate in an automated fashion without the need for manual on-premise labor for configuration or data import.

3.         Rapid elasticity: NIST describes this characteristic as capabilities that “scale rapidly outward and inward commensurate with demand. To the consumer, the capabilities available for provisioning often appear to be unlimited and can be appropriated in any quantity at any time.” This important benefit of cloud computing is accomplished by a parallelized software architecture designed to dynamically scale out over potentially several dozen virtualized servers to enable rapid ingestion, processing and analysis of data sets in that cloud environment. This capability would allow several terabytes of data to be indexed and processed within 2 to 4 hours on a highly automated basis at far less cost than non-cloud eDiscovery efforts.

However, many characteristics of leading eDiscovery solutions fundamentality prevent their ability to support this core cloud requirement. Most eDiscovery early case assessment solutions are developed and configured toward a monolithic processing schema designed to operate on a single expensive hardware apparatus. While recently spawning some bold marketing claims of high speeds and feeds, such architecture is very ill-suited to the cloud, which is powered by highly distributed processing across multitudes of servers. Additionally, many of the leading eDiscovery and enterprise search solutions are tightly integrated with third party databases and other OEM technology that cannot be easily decoupled (and also present possible licensing constraints) making such elasticity physically and even legally impossible.

So is there eDiscovery software that will truly support the IaaS cloud based upon these requirements, and address up to terabytes of data?  Stay tuned….

Leave a comment

Filed under Cloud Data, Enterprise eDiscovery, IaaS

Can Lawyers Be Disqualified by Merely Viewing a Linkedin Profile? The Implications of Indirect Social Media Communications and Legal Ethics Rules

With attorneys and their hired consultants routinely collecting social media evidence for investigation and eDiscovery purposes, it is important to be aware that such activity can generate various direct and indirect communications to the subject account owners.  Sending a Facebook “friend” or a LinkedIn “connect” request are obvious examples, but there are also less overt means of social media communications. For instance, if a hypothetical law firm named Smith & Wesson were to merely follow a witness on Twitter, the service will automatically email the witness with a notification that Smith & Wesson is now following her. Additionally, it is all too easy when viewing a Facebook page to inadvertently “like” an item or accidentally send a friend request through a single mouse click.  And if you simply view another’s Linkedin profile while logged into your own account, that person will often be notified that you viewed his or her profile page.  Ethical Implication

For lawyers and their hired consultants and investigators, all this can be very problematic considering legal ethics rules that strictly regulate communications with represented parties and even jurors connected to a case. Several local and state bar associations have issued legal ethics opinions discussing this issue specific to collecting social media evidence. On December 6, X1 Discovery hosted a live webinar to delve deeper into this topic with the esteemed Ralph Losey of Jackson Lewis as the featured speaker. Ralph is the lead eDiscovery partner at Jackson Lewis and the author of “The eDiscovery Team,” considered by many to be the best legal eDiscovery blog on the planet. You can register for the recorded version of this webinar at this link here. (One hour of ethics CLE credit will be available to California attorneys).

From our perspective, this critical concern involving indirect social media communications and legal ethics underscores the importance of employing best practices technology to search and collect social media evidence for investigative and eDiscovery purposes.  Collecting evidence in a manner that prevents, or at minimum, does not require that attorneys and their proxies directly or indirectly communicate with the subjects from whom they are collecting social media evidence is a core requirement for solutions that truly address investigative and eDiscovery requirements for social media. If user credentials to the social media account have been properly obtained, that is obviously ideal. However, in many instances lawyers must resort to searching and collecting publicly available information. In such situations, it is crucial that the law firm and/or its hired experts conduct such collections in the proper manner.

For instance, X1 Social Discovery software features public Facebook capture that can search and collect publicly available Facebook pages without directly or indirectly notifying the account holder. This is critical functionality for eDiscovery preservation. Additionally, X1 Social Discovery accesses and displays Facebook pages in read-only mode, preventing metadata alternation, inadvertent friend requests or “like” tagging through a simple slip of the mouse. X1 Social Discovery includes other features concerning Twitter and Linkedin that also prevent indirect communications while effectively collecting data from those sites. We will be highlighting those features in the next few weeks, but in the meantime, we hope you enjoy our webinar.

3 Comments

Filed under Best Practices, Legal Ethics & Social Media