Skip the navigation
)

QuickStudy: Filing It Away

By Todd R. Weiss
September 2, 2002 12:00 PM ET

Computerworld - File systems organize the data stored on computer hard drives, keeping track of the physical locations of all data elements on disk while allowing users to quickly and reliably retrieve files when needed.

The file system acts as a digital index that lets a computer instantly find a specific file, regardless of the size or configuration of the storage drive or where the data bytes associated with the file sit on the drive's storage platters.

Every operating system, from MS-DOS to Windows 95, Windows XP and Linux, has its own file system. But although all file systems perform the same basic functions, they vary in design and sophistication.

FAT

File systems have come a long way since MS-DOS and early versions of Windows. Those operating systems organized files under the FAT file system, which represents logical areas of the disk in allocation units called clusters, and maps the locations of file data to those areas using a file allocation table (FAT). FAT is also called FAT16 because it uses a 16-bit address space for tracking files and clusters.

FAT clusters vary with the size of the disk. FAT's 16-bit address space can support up to 65,536 clusters (216). With a 65MB disk, cluster sizes were just 1KB in size, but they ballooned as disks emerged that were able to hold gigabytes of data. And since only a single file can be written to a cluster, this created inefficiencies that ended up wasting as much as 50% of available space on a 2GB disk drive.

FAT32

FAT32, which debuted with Windows 95 OEM Service Release 2 (OSR2), introduced a 32-bit address space. By increasing the size of the file allocation table, it could support more clusters that were smaller in size on large disk drives, reducing the potential for wasted drive space.

Another FAT32 innovation was that it could handle file names with up to 255 characters, whereas FAT could only handle names with up to eight characters. Users could finally create long file names to better describe the contents.

The advent of FAT32 extended the maximum addressable volume size from 2GB to 2TB and improved reliability by allowing the system to switch to a copy of the file allocation table if the default copy should become damaged. But FAT32 also added to file system overhead and was therefore inefficient to run on disks smaller than 260MB.

NTFS

The next development in Windows file systems was the New Technology File System (NTFS), introduced with Windows NT (which also supported FAT32). With a 64-bit address space and the ability to vary cluster size independently of the disk drive size, NTFS virtually eliminated the cluster size limitation problem.

It also brought other benefits, including file and directory security attributes, file encryption and support for storage volumes of up to 16TB and 232 clusters.

NTFS replaced the familiar file allocation table format with the Master File Table (MFT), which holds more information about files than did FAT. The MFT references all files and directories on the disk drive, including associated metadata such as security settings.

The NTFS also introduced a high level of fault tolerance. It logs disk operation activity prior to committing the transaction. If the system crashes during an update, it can examine the log file and restore the data. When read or write errors occur during normal operation, NTFS automatically identifies and blocks out the bad clusters and copies the data to a new location. Finally, NTFS creates a mirror of the MFT and can revert to the mirror should the original fail.

NTFS's overhead makes it unsuitable for disks smaller than 400MB, and it can't be used on floppy disks. Instead, Windows must write to formatted diskettes using FAT32.

Ext2

The Linux file system, called Extended File System 2 (Ext2), evolved to rectify limitations of Linux's original file system, Ext, which the operating system inherited from its Minix predecessor. Under the Minix file system, the maximum file system size was restricted to 64MB and file names to 14 characters.

Ext supported 2GB file systems and 255-character file names but suffered from some performance limitations. Ext2 supports 4TB file systems and 255 character file names and remedies those problems.

The Ext2 architecture uses a data structure called identification nodes (inodes) to refer to and locate files and associated data. The inode table includes the file type, size, access rights, pointers to associated data blocks and other attributes. The file system organizes disk space into groups of blocks, which contain both inode information and associated data blocks.

The Linux kernel uses the Virtual File System layer, which interacts with the file system to perform disk I/O. This gives Linux the ability to support multiple file systems, including DOS, FAT16 and FAT32 (which it supports as a native file system).

FILE SYSTEM SUPPORT BY OPERATING SYSTEM

OPERATING SYSTEM FILE SYSTEMS SUPPORTED
MS-DOS, Windows 95 FAT16
Windows 95 OSR2, 98, Me FAT16, FAT32
Windows NT, 2000, XP NTFS, FAT16, FAT32
Linux Ext2, FAT32, FAT16

File System Specs

FILE SYSTEM MAXIMUM FILE NAME LENGTH (CHARACTERS) MAXIMUM VOLUME SIZE MAXIMUM FILE SIZE
FAT16 8 2GB* 2GB
FAT32 255 2TB 4GB
NTFS 255 16TB 16TB
Ext2 255 4TB 2GB

*4GB under Windows NT


See additional Computerworld QuickStudies

Read more about Operating Systems in Computerworld's Operating Systems Topic Center.



What is Tech Briefcase?
TechBriefcase is a new, free service where IT Professionals can Search, Store and Share IT white papers and content like this. Learn more
Bookmark content
Speed up your research efforts with content across the web.
Search and Store
Find the white papers you need. Create folders for any topic.
View Anywhere
Open your briefcase on your iPhone, tablet or desktop. Share with colleagues.
Don't have an account yet?
Additional Resources
Security KnowledgeVault
WHITE PAPER
Security is not an option. This KnowledgeVault Series offers professional advice how to be proactive in the fight against cybercrimes and multi-layered security threats; how to adopt a holistic approach to protecting and managing data; and how to hire a qualified security assessor. Make security your Number 1 priority.

Read now.

Cut Communications Costs Once and for All
WHITE PAPER
New IP-based communications systems are being deployed by small and midsized businesses at a rapid rate. Learn how these organizations are enabling faster responsiveness, creating better customer experiences, speeding office or mobile interactions, and dramatically reducing existing communications costs.

Read now.

Operating Systems White Papers
Ready Your Enterprise for the Next Generation of Client Computing
Data is now accessed via physical and virtual desktops, laptops and mobile devices. IT organizations struggle to control and manage the widening pool...
Consolidating Lotus Domino x86 Workloads on IBM Power Systems
Read the white paper to learn how moving up to Lotus Domino 8.5 and consolidating with IBM Power Servers can help you boost...
A Comparison of PowerVM and VMware vSphere (4.1 & 5.0) Virtualization Performance
This technical white paper presents benchmark results showing greater VM consolidation ratios than demonstrated in previous benchmarks and demonstrating the extent of the...
How Nimsoft Service Desk Speeds Deployment and Time to Value
For years, many support teams have been hamstrung by their traditional service desk platforms, which require complex, time-consuming coding for virtually every aspect...
Practice Management: Double Billing Rate and Improve Patient Services
Would you like to double your billing rate and achieve faster payment for services?

Download this customer success story to see how One Health...
All Operating Systems White Papers
Operating Systems Webcasts
Best Practices in Monitoring VMware
The benefits of virtualization are unassailable: increased agility, scale, and cost savings to name a few. However, so too are the monitoring challenges...
Distributed Database Security with Real-time Monitoring
View this demo and learn how IBM InfoSphere Guardium database activity monitoring can help protect your sensitive data in distributed DBMS environments with...
InfoSphere Warehouse Packs Demo
These flash modules make warehousing more tangible and relevant to business users through detailed explanations of the InfoSphere Warehouse Packs.
Delivery Management -- Extending Lifecycle Management
Date: Wednesday, June 20, 2012, 1:00 PM EDT

Siloed organizations continue doing the wrong things and doing things wrong, leading to increased costs,...
Leverage automation today to reduce IT complexity
Date: Tuesday, June 5, 2012, 2:00 PM EDT

Whether your B2B complexity is caused by multiple technologies due to M&A, business or application specific...
All Operating Systems Webcasts
Newsletter Sign-Up

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all newsletters | Privacy Policy
IT Jobs