A new Smallpdf study reveals that unmanaged PDFs are creating significant enterprise dark data and storage challenges. The survey found that oversized documents are reducing productivity, increasing storage demands, and leaving document workflows outside standard IT governance, highlighting the need for better document management and oversight.
Enterprise IT teams are investing heavily in AI, automation, cloud storage, and data governance, but everyday document workflows may be creating a quieter operational blind spot. New research from Smallpdf finds that oversized PDFs, duplicated files, and unaudited document archives are adding to dark data, storage waste, and workflow inefficiency across the enterprise.
Smallpdf surveyed 1,001 full-time U.S. employees to understand how workers handle file size, PDF compression, document duplication, and archived files. The findings show that while organizations are modernizing digital operations, many still lack basic controls for one of the most common forms of workplace data: business documents.
According to the study, 78% of employees have personally experienced at least one work-related problem caused by large file sizes. These issues include rejected emails, stalled uploads, documents that could not be shared, disrupted workflows, storage constraints, and missed or delayed deadlines.
Despite the widespread impact, only 27% of organizations report lost productivity from slow uploads or downloads, suggesting a visibility gap between employee experience and enterprise workflow measurement. For IT leaders trying to improve automation, productivity, and data oversight, this creates a hidden operational layer that may not be captured in existing dashboards, policies, or cost-control programs.
“Enterprise teams are focused on AI readiness, automation, and stronger data governance, but unmanaged documents can still sit outside that strategy,” said a Smallpdf spokesperson. “Oversized PDFs and duplicated files may look like small workflow problems, but at scale they become storage, visibility, and governance issues.”
The research found that 79% of employees say their organization has no formal policy for document compression or file-size limits. This includes 62% who said definitively that no such policy exists and 17% who did not know whether one was in place. The absence of clear standards means oversized documents can accumulate across inboxes, cloud drives, shared folders, collaboration platforms, and long-term archives without routine review.
Archived PDFs are a particular concern. Smallpdf found that an estimated 66% of archived PDFs are stored without any file-size review or compression. In addition, 28% of companies rarely or never audit stored documents. This can leave redundant, bloated, or outdated files sitting in storage environments beyond the reach of normal oversight.
The productivity consequences are also measurable. Nearly half of employees, or 45%, said they have had a file rejected by email due to size limits. Almost 2 in 5 professionals, or 38%, said they were unable to share a document at all because of file size, while 37% said they had waited an unusually long time for a file to upload or download.
On average, employees lose 15 minutes per week dealing with file-size issues. That adds up to more than 12 hours per year per employee. Across large teams, those delays can create meaningful productivity losses, especially when employees are already moving between email, cloud storage, document management systems, collaboration tools, and workflow platforms.
Storage costs are another pressure point. Nearly 1 in 5 employees, or 18%, said storage limits disrupted workflows, while 16% said their organization had to purchase additional storage sooner than planned. Nearly 1 in 10, or 8%, said file-size issues delayed or caused them to miss a project deadline.
The study also found that many employees are not checking file sizes before they create new storage and workflow problems. Half of employees, or 50%, said they often or always upload PDFs without checking or reducing file size first. More than 4 in 5 employees, or 81%, said they do not think about file size until it causes a problem.
Document duplication is compounding the issue. More than 1 in 2 professionals, or 56%, said they have duplicated a PDF multiple times instead of compressing or replacing the original. This behavior can create redundant files, version confusion, unnecessary storage use, and weaker information management.
For organizations preparing data environments for AI, these habits may become increasingly important. AI-powered search, classification, summarization, and workflow automation depend on clean, accessible, and well-managed information. Large volumes of duplicated or unmanaged files can make it harder for businesses to maintain control over the data that feeds digital workflows.
Smallpdf says the solution does not require major infrastructure changes. According to Smallpdf internal data, compressing a PDF reduces file size by an average of 54%. More than half of workers, or 51%, said they would compress files more often if the process were easier.
The company recommends that organizations introduce simple document governance practices, including file-size limits, compression rules, regular archive reviews, and clearer policies for replacing outdated documents instead of duplicating them. These practices can help reduce storage waste, improve workflow efficiency, and give IT teams better control over everyday business documents.
“Better document management is one of the lowest-friction ways to improve digital operations,” the Smallpdf spokesperson said. “When teams compress large files, reduce duplication, and review archives regularly, they can cut storage waste and improve the quality of the information their business relies on.”
As enterprises continue investing in AI and cloud-based workflows, Smallpdf says document hygiene should become part of the broader conversation around operational resilience, cost control, and data governance. Oversized PDFs may seem like a routine inconvenience, but the findings suggest they are contributing to a larger enterprise challenge: unmanaged documents becoming another layer of dark data.
Key findings include:
- 78% of employees have experienced at least one work-related problem caused by large file sizes.
- 79% of workers say their company has no formal file compression policy or file-size limit.
- 66% of archived PDFs are stored without file-size review or compression.
- 56% of professionals have duplicated a PDF multiple times instead of compressing or replacing it.
- 50% of employees often or always upload PDFs without checking or reducing file size first.
- 45% have had a file rejected by email because of size limits.
- 38% have been unable to share a document due to file size.
- Employees lose an average of 15 minutes per week dealing with file-size issues.
- 28% of companies rarely or never audit stored documents.
- Compressing a PDF reduces file size by an average of 54%, according to Smallpdf internal data.
For more information, visit: https://smallpdf.com/.
View the report at: https://smallpdf.com/blog/hidden-cost-oversized-pdfs
Methodology
Smallpdf commissioned an online survey of 1,001 full-time U.S. employees. Respondents were screened for full-time employment status and represent a range of industries, generations, and role levels. The sample includes workers across healthcare, information technology, education, accounting and finance, government and public sector, manufacturing, retail and e-commerce, and other sectors.
Generational breakdowns reflect Gen Z at 13%, millennials at 59%, Gen X at 25%, and baby boomers at 4%. Averages for open-ended numeric responses were calculated using the interquartile range method to limit the influence of outliers. Industry-level findings are reported only for segments with at least 50 respondents. Percentages may not sum to 100% due to rounding.
Related News: