eDiscovery solutions help companies retrieve electronic data to support lawsuits with documentary evidence at the right time. These data include e-mails, Microsoft Word and PDF files, spreadsheets… as well as intranets and extranets. The Web is the new way to publish data. Discovering data through the Web (intranet, extranet) is a growing need for numerous companies involved in disputes like international lawsuits or complaints for discrimination. In order to respond quickly to the authorities’ requests, companies must be able to retrieve and present every electronic data stored on their desktop and laptop computers, servers, storage and archiving systems, websites are part of these.
With CAMA®, you may now collect, organize, archive (for long-term preservation) and access to your Web content even if it is not on-line anymore. Our crawl platform allows you to archive a part or the whole of your websites and to store them while respecting a normalized “WARC” (ISO 28500:2009) format for further research and reference.
“Web eDiscovery” solutions from Aleph Archives are fully compliant with the “Electronic Discovery Reference Model” (EDRM) guidelines and standards. They will let you :
- Archive any Web content in its original shape, no matter what technology is used, without any alteration (authentic archives) so as to be able to present it as evidence in case of lawsuits.
- Use an autonomous, multi-users, multi-sites and proprietary-technologies-free platform.
- Easily browse through your Web archives as they were on the day you saved them.
- Perform a thorough search on a specific content into your archives.
- Execute analysis functions to use archived data efficiently.
- Export analysis results as e-mails or files.
Litigation Support & Protection
Thanks to Aleph Archives’s “Web eDiscovery” data management technology, you will be able to control the whole of your company’s web-based data, thus reducing risks and costs, and increasing productivity and intelligence quality.
The CAMA® platform is a reliable Web archiving system customizable to your specific needs, whether you are a company, a law firm or a government agency. This is made possible with unlimited combinations of data collecting, processing and automatic examination modules. Some systems are already configured so as to meet the requirements of today’s companies.
"Technology initiatives can help law firms gain and keep a competitive advantage. Electronic litigation support systems can cut the time spent on document-intensive tasks, reduce costs, and promote efficiency“
May 2003 – Kelvin McGregor-Alcorn, Oyez Legal Technologies
The CAMA® platform allows legal and IT departments within companies to manage the most expensive and tedious elements in the eventuality of litigation events, by themselves and in an efficient way. The global costs implied by lawsuits are thus minimized, and external legal advisers avoid being sued thanks to eDiscovery thorough data management.
CAMA® is based on the EDRM (Electronic Discovery Reference Model) and lets you :
- Automate the identification and gathering of Web-based data before its storage and legal keeping.
- Gather, manage and organize data before a first automated examination and a thorough legal examination.
- Use reliable search, sort-by-relevance, filtering and deduplication tools for more efficient examinations.
- Access a simple and intuitive interface for external legal examinations.
- Produce audit follow-ups and reports to show a defensible methodology was used.
Regulatory compliance & Web Records Keeping
The CAMA® platform extends the other litigation support and compliance softwares possibilities to the Web. It allows companies to comply with the new laws relative to their workers’ activity on social networks.
The fast adoption of social networks by workers, clients, advertisers, bloggers and information groups rises new challenges for many companies. Watchdogs know about the influence and risks associated with the use of these new tools, and are starting to demand companies that they manage and keep an eye on their workers’ activity on social networks.
New legal requests concerning the Web and social networks add even more constraints to the already-complex task of complying to the law. Aleph Archives helps companies to maintain their compliance, against the never-ending evolutions of the whole set of legal requests concerning various lines of communication.
According to recent researches, the average life expectancy of a website is less than 75 days, and disputes over the content of websites are on the increase. In a certain number of countries, there are regulatory and archiving compliance regulations (i.e. Sarbanes-Oxley Act - US, Health Insurance Portability and Accountability Act - US, Gramm-Leach Bliley Act. -US, Federal Rules of Civil Procedure - US, etc) governing, and authorities (i.e. SEC and FINRA - US, Financial Services Authority - UK) based thereon which supervise, the different industry sectors.
For just about every organization, there are a large and growing number of regulatory obligations to preserve electronic content. Some of the more important requirements are:
- Sarbanes-Oxley Act of 2002
- Health Insurance Portability and Accountability Act of 1996 (HIPAA)
- Securities and Exchange Commission Rules (SEC)
- Financial Industry Regulatory Authority (FINRA)
- Model Requirements for the Management of Electronic Records (MoReq)
Business Intelligence goals
Business Intelligence (BI) solutions grant users an immediate access to reliable information, in order to take quicker and more efficient vital management decisions. They make users able to find and analyze data about companies and their sector, which is a basic requirement on the worldwide market. But these data are of use only when processed and given to the appropriate decision-makers.
Companies of every size know about the impact of BI solutions. They give the ability to process huge quantities of data and extract relevant information. Relevant information -”knowledge”- can be redistributed to people and departments for better decisions.
CAMA® is part of the four main decisional steps in BI :
- Data extraction (Web Scrapping) : To obtain significant results, one must gather Web-based data wherever they remain. When connected to Web data sources, CAMA® gathers relevant data and centralizes it in its distributed data warehouse.
- Strengthening : Once centralized, data must be analyzed and distributed inside the data warehouse. This preprocessing makes it easier for CAMA® tools to access data, since data warehouses are automated.
- Processing : From a request based upon dedicated search forms, the analysis tool collates related data to find relevant information.
- Reporting : This step is about broadcasting and presenting information with added value so that they appear as readable as possible to the decision-maker.
The two main obstacles one encounters when keeping a digital document are that of the fragility of storage medium and that of technical obsolescence. Websites archiving, moreover, raises the problem of selecting and gathering data from complex websites (national libraries for example).
As digital documents, websites are particularly complex elements. They are made of several types of media : text, images, sounds, animations and streams.
There are two ways to consider how to archive a website :
- Manual or semi-automatic archiving, which aims at collecting sites according to a determined selection.
- Automatic archiving with the help of indexation robots. These robots visit websites on a regular basis and capture them either partially or totally (as snapshots).
Web collecting platform CAMA® is part of that last category and aims at completing the following :
- CAMA® robots collect contents regardless of internal website architectures and publishing platforms.
- Your stored data’s authenticity is guaranteed by CAMA® as unaltered by human hands.
- CAMA® gives you the choice to use your archives with other analysis tools (such as linguistic software) as it is based on a open and normalized WARC storage format (ISO 28500:2009).
- CAMA® offers a friendly and natural access to your archives. Access servers are available within your infrastructure or on-line as PaaS (Platforms as a Service)