Our Insights

Thought Leadership and Industry Trends

Home 9 Insights 9 How Are You Dealing with eDiscovery of Your Internet Content?

How Are You Dealing with eDiscovery of Your Internet Content?

Aug 8, 2018

Discovery isn’t restricted to data found on file servers, laptops and phones anymore. Increasingly, data is being collected from the internet, which poses unique challenges. Courts are grappling with how much and when such data is discoverable, leaving companies to determine how to collect and preserve the information in case it is needed. Types of web content Internet content includes websites, social media and videos. In addition, the Internet of Things is fast becoming an important source of data. The use of or need to connect everything to the internet continues to increase year over year. There are a range of applications that utilize technology to communicate and interact with others via the internet, such as household appliances, security systems, smart watches, cars, medical equipment and other devices that store and receive or transmit data. These data sources provide a wealth of information that may be relevant to a litigated matter. For example, social media can provide data on employment, residential history, social media mentions, geolocation, profiles, groups/organizations, or authored content like blogs, YouTube videos, reviews, etc. The most popular social media sites are Facebook, LinkedIn, Twitter, YouTube and Instagram, but there are many others that litigants could potentially mine for eDiscovery purposes. eDiscovery challenges Collecting internet data presents significant issues due to its size, difficulty in collecting it and retaining it for possible future litigation. These concerns are compounded by the fact that this data consists of dynamic content.

Finding and preserving the data. Attorneys are typically challenged by having to find the data they need. There are applications that help identify devices connected to the internet like Shodan but that may not be enough to locate all sources. One of the biggest issues with collecting internet data is that it isn’t static. Most web content is fluid and ever-changing, plus it is easily deleted or overwritten. Some applications regularly store data in storage systems or allow the end-user to make their own determination if they want to store their data. However, the application’s data retention policies might not meet discovery requirements. This poses another problem: how to collect information from devices that don’t have the ability to protect data if needed on a litigation hold?
Capturing data. Companies must understand how and when to capture internet data. Maintaining proper digital chain of custody – that is, establishing that electronic evidence presented to the court is the same as what was originally collected – is crucial. This is difficult because electronic information is easily changed or deleted, and the record of the changes can be hard to document. That leaves such evidence potentially vulnerable to legal challenges.

When data is captured, its metadata must be captured as well. Time and date of the collection is critical since the data isn’t static. In addition, website URL, IP addresses, source code and other information may also be relevant. The universe of possible data locations that might hold relevant information is dramatically expanding. Companies must be aware of these issues because internet data has become a major component of virtually every type of litigation. It is more important than ever to have a team of experts on your side to advise on creating a plan to mitigate the risks associated with these data sources. To learn more about how CDS can help with data collection and preservation, contact us for a consultation.

About the Author

Regina Chepalis

As Vice President of Sales, Regina Chepalis leads the CDS global sale team focused on offering technology-enabled solutions to a rapidly changing eDiscovery market. Regina has over 40 years of leadership experience in the legal service industry working for service providers, law firms and governmental agencies in a variety of roles.

23 April 2024

Relativity AI Bootcamp: Atlanta

Relativity is kicking off a third season of AI Bootcamps on April 23-24 in Atlanta, where CDS’ Director of Advanced Analytics & Data Privacy Danny Diette will be a featured panelist.

Find out more

01 May 2024

7th Annual Putting Insights into Practice Forum

Navigate a virtual journey through today’s biggest legal data management challenges at PIIP 2024: ADVENTURES ON THE DATA CONTINUUM

Find out more

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	This cookie is set by LinkedIn and used for routing.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gcl_au	3 months	This cookie is used by Google Analytics to understand user interaction with the website.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
_hjFirstSeen	30 minutes	This is set by Hotjar to identify a new user’s first session. It stores a true/false value, indicating whether this was the first time Hotjar saw this user. It is used by Recording filters to identify new user sessions.
oktgid	1 year	This cookie is used for storing the visitor ID of the user who clicked on an okt.to link.
oktsid		This cookie is used for storing the session ID of the user who clicked on an okt.to link.
pardot	past	The cookie is set when the visitor is logged in as a Pardot user.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to deliver advertisement when they are on Facebook or a digital platform powered by Facebook advertising after visiting this website.
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
fr	3 months	The cookie is set by Facebook to show relevant advertisments to the users and measure and improve the advertisements. The cookie also tracks the behavior of the user across the web on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Duration	Description
_dc_gtm_UA-109542572-2	1 minute	No description
_hjAbsoluteSessionInProgress	30 minutes	No description
_hjid	1 year	This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.
_hjIncludedInPageviewSample	2 minutes	No description
_hjTLDTest	session	No description
AnalyticsSyncHistory	1 month	No description
CONSENT	16 years 8 months 26 days 9 hours 2 minutes	No description
UserMatchHistory	1 month	Linkedin - Used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.

Our Insights

Thought Leadership and Industry Trends

How Are You Dealing with eDiscovery of Your Internet Content?

Regina Chepalis

Relativity AI Bootcamp: Atlanta

7th Annual Putting Insights into Practice Forum

Our Blog

Sign Up for Our Newsletter

About CDS

Contact Us