Through this visual infographics, dive into a client case study. Discover how the Harbor team collected, processed, analyzed and reviewed 100,000 emails with limited time and a small budget, and provided the responsive data to the investigations team in a matter of two weeks.
For trial lawyers, Early Case Assessment (ECA) has always been the process of quickly synthesizing information from multiple sources to craft an initial case strategy. This process typically involves working closely with the client to identify and interview key witnesses, to review important documents, and to develop preliminary discovery and litigation plans.
The explosion of electronic data in the 1990s had many eDiscovery software companies clamoring to develop ECA tools to better manage the data. These tools allowed legal teams to cull data using keywords, dates, and other file characteristics in an attempt to reduce and/or prioritize the files that require review. These preliminary instruments even provided some insight into potential discovery costs associated with litigating. But their utility was limited, compared to what the industry needed.
This ‘early data assessment’, while valuable, often didn’t fully help lawyers to access and analyze information. What data was truly useful, what was not? Which documents would play an essential role in formulating case strategies? Which were simply irrelevant or redundant?
Keywords proved to be an inefficient means of organizing data. They offered a glimmer of insight, but did not deliver a dynamic way for attorneys to understand a case, or the means to help evolve that understanding. Moreso, keyword search alone demonstrated a flawed method of locating potentially relevant files (in terms of both recall and precision). It tended to overlook too many important documents, and “hit” on far too many irrelevant ones. Too wide, too shallow.,
Analytics, fortunately, offer the potential to bridge the gap. They enhance a lawyer’s assessment of cases through scientific analysis of the data. ‘Scientific’ being the operable word.
Today, metrics, correlations, associations, occurrences, and algorithms have come to the forefront. (Well, maybe behind the scenes.) When deployed at the ECA stage, analytics can not only inform the development of early case strategy, but it can also provide a more sophisticated means of culling data for review. This makes it great for estimating and reducing overall discovery costs, as well.
Standard features found in most eDiscovery analytics tools offer these functions:
1) Conceptual Clustering – Documents are analyzed based on their text, and then complex algorithms group documents together based on their conceptual similarity. Now, related items and topics start to cluster together, for easier observation. Even though the words within the document may be different, clustering will still group documents together, if they are conceptually similar.
Practical Uses of Conceptual Clustering:
2) Key-Term Expansion – This tool first identifies conceptually related terms found in your content, and then ranks them in order of relevance. The user dictates the status, grade and order of subjects.
Start with a keyword. The tool provides a list of similar, or very related, terms. The results allow reviewers to expand the search to include documents containing other near or related terms.
For example, a search for “President Roosevelt” might produce a list such as: Theodore Roosevelt, Teddy Roosevelt, Theodore Roosevelt Jr., Franklin Delano Roosevelt, FDR, Commander-in-Chief, Vice President Roosevelt, Senator Roosevelt, Assemblyman Roosevelt, Eleanor Roosevelt, the Oval Office, Office of the President, POTUS, etc.
When using key-term expansion, a reviewer searching for important documents based on keywords can conduct a much more comprehensive and defensible search. This expansion of terms will produce more meaningful and trustworthy results.
3) Conceptual Search – The tool finds documents conceptually related to a known term or phrase. Comparable documents get grouped together by their correlated concept.
Imagine you’ve located a key phrase or paragraph. Now you want to find similar ones that correspond to it. Concept searching will hunt for and assemble conceptually similar documents – even if they don’t contain that exact same term(s) used in the initial search. These are documents that would not be found with keyword searching. At the same time, concept searching eliminates false positives from synonyms and polysemes. An attorney can quickly zero in on top priority documents for immediate review.
4) Email Threading – Email threading identifies emails that were once part of the same email thread (or conversation).
5) Near-Duplicate Identification – Deduplication removes documents that are 100% duplicative, but what happens when they’re only 99% similar? Near Dupe (ND) detection will identify documents that have the same words, in the same order, and group them together. This has nothing to do with conceptual similarity – it’s a literal approach to similarity. So, those emails you get every morning from Yahoo Finance that have almost exactly the same text but with a few slight differences … they’ll be grouped together.
6) Computer Assisted Review (“Predictive Coding” or “Technology Assisted Review”) – The goal of computer assisted review is to train the analytics tool to make consistent, reliable responsiveness decisions on large sets of data. This can vastly reduce the volume of documents human review for production.
Harbor’s ECA workflow leverages a processing engine that’s fully integrated into our Relativity environment. It reduces the time it takes to get access to the documents, and it provides those documents in a familiar review format. Once our system ingests data, the reviewer has access to a host of traditional features such as keyword search, reporting, and powerful culling strategies that include deduplication and de-NISTing. This workflow also offers advanced options like data visualization, near-duplication detection, data pivoting, sampling, email threading, clustering and conceptual searching.
Brainspace powers Harbor’s analytics offering and enables a truly unique analytics experience. It dynamically links multiple views of data that encompass: Overview Dashboard, transparent concept search, timeline, document clusters, communication analysis, and structured data facets.
Robust tools reveal the story inside your data by using powerful, interactive visualizations–even with the largest datasets. Our Dashboard, Focus Wheel, and Communication Network Graph all link together dynamically to provide multiple perspectives on any data set, or sub-data set.
Truly transparent Concept search gives reviewers in ECA complete control over the power of analytics, while helping them maintain a clearer understanding. It takes the guess work out of concept expansion, and delivers a versatile, defensible platform for attorneys.
State-of-the-art social network visualization enables users to effortlessly navigate the social media graph. It reveals the content and context of conversations, posts, direction of information flow, CC, BCC, and powerful, simple, alias consolidation.
Our unique approach to document classification incorporates multiple active learning methods to accelerate system training, depth and recall for planning and cost analysis, and delivers best-in- class matching results. Review less and decrease costs.
Contact Harbor Litigation, today, to see how our customizable ECA workflows can accelerate case understanding, defensibly reduce data sets, and significantly lower review costs.
Managing eDiscovery in the cloud is in the future for many organizations; but for others it’s the present. Managed eDiscovery offers law firms, corporations, and government entities the tools to control both costs and processes throughout the eDiscovery lifecycle.
There are four major components to eDiscovery operations: people, processes, software, and hardware. Managed eDiscovery allows your people to implement your processes, utilizing vendor software and hardware to run your operations. When the need arises, you have access to the vendor’s expertise. In some cases, you can license software yourself, and install it on the vendor’s hardware for your use.
In a nutshell, managed eDiscovery gives you your own customized eDiscovery solution without the capital outlays, maintenance, upgrades and personnel commitments required to build it yourself.
In the recent past, most legal departments made a choice between vendor-reliance and building in-house eDiscovery capabilities. When in-house capacity was insufficient, the legal department outsourced overflow to vendors.
Many companies found vendor-reliance unacceptable. Cost-predictions were often futile pricing models, compressed data and lack of communication frequently led to invoices that far exceeded estimates. Vendor workflows didn’t always mesh with in-house processes, and “black-box” vendor services caused uncertainty and frustration in setting and meeting expectations.
In response, some legal departments sought to build their own internal eDiscovery capability. This approach had the advantage of process and workflow control. In addition, companies were able to realize cost savings, and some law firms managed to create profit centers from their eDiscovery services.
However, the required investment in technology and expertise made in-house eDiscovery too expensive for the majority of companies and firms. Others made business decisions not to go the in-house route to limit risk exposure or to focus on core offerings. Yet companies and firms without robust litigation support departments found themselves at a competitive disadvantage, and largely powerless to exercise any control over escalating eDiscovery costs.
Even the companies who did build eDiscovery departments are revisiting their in-house
model because of certain market realities:
Managed eDiscovery presents an alternative “hybrid” option for companies who outsource to vendors, as well as for companies with in-house capability. Companies with in-house litigation support departments lose nothing by adding managed services. They still leverage their experience and knowledge on future matters, maintain their existing workflows, and exercise control over their data. And they gain much lower costs without capital investments, the advantage of rapid scaling, and the ability to outsource services if and when they want to.
Managed eDiscovery is a combination of cloud computing and support services. Cloud computing is a collection of technologies that allow access to computing power through the internet, instead of an organization’s server room. Managed eDiscovery takes primary advantage of two cloud computing technologies: Software as a Service (SaaS) and Infrastructure as a Service (IaaS).
Any software application accessible as a web page is considered SaaS. SaaS is commonly used in the legal industry for hosted review. In a pure SaaS model, the software is licensed by the vendor who also takes responsibility for all maintenance including upgrades, patches, security and redundancy. If you need your storage to quickly spike up, your SaaS vendor can ramp up your storage allocation, usually without interrupting existing processes. You pay for the additional storage only for as long as you need it.
IaaS grants customers access to servers, routers, storage, and other computing infrastructure over the internet. These services allow companies to utilize the internet for scalable storage and processing cycles. The infrastructure is similar to co-locating equipment at an offsite data center, except you don’t have to buy the equipment. Instead you only pay for what you use, and the environment can be scaled up or down to match the uneven workflow common in e-Discovery.
Typically, an organization uses in-house resources to handle eDiscovery phases through (or up to) collection. After collection, data is uploaded to the service provider’s data center. Some vendors offer high speed FTP (or FTP-like) transfer options while large data sets are often shipped directly to the data center. Your in-house technicians can take over from there and handle any or all phases from processing through production, including the setup and project management of hosted review databases.
With Managed eDiscovery, your technicians and project managers can log into software hosted in a secure data center and perform as much, or as little, of the actual data manipulation and project management as you choose. The service provider fills in the gaps and provides technical assistance. The software can be licensed by the vendor or by you.
For corporations, Managed eDiscovery allows attorneys to push all matters through the company’s workflows in a centralized location, collaborating with outside counsel wherever they’re physically located. Data can easily be harvested once and then used in multiple matters, replicating privilege and redaction calls where appropriate.
Additionally, organizations may find it easier to budget for Managed eDiscovery, as capital expenditures typically require more layers of approval and more advance notice than an expense budget. It’s also easier to manage and predict costs and return on investment with the monthly billing of Managed eDiscovery, instead of the startup costs, depreciation, and labor associated with buying and maintaining your own hardware and software.
Perhaps most importantly, Managed eDiscovery reduces stress on your internal systems and the people who maintain them.