<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:admin="http://webns.net/mvcb/"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
	<channel> 

	<title>Comments on: Too many files?</title>
	<link>http://ask.metafilter.com/56152/Too-many-files/</link>
	<description>Comments on Ask MetaFilter post Too many files?</description>
	<pubDate>Thu, 01 Feb 2007 08:22:53 -0800</pubDate>
	<lastBuildDate>Thu, 01 Feb 2007 08:22:53 -0800</lastBuildDate>
	<language>en-us</language>
	<docs>http://blogs.law.harvard.edu/tech/rss</docs>
	<ttl>60</ttl>

	<item>
		<title>Question: Too many files?</title>
		<link>http://ask.metafilter.com/56152/Too-many-files</link>	
		<description>Is it possible for Windows 2000/XP (or more properly, an NTFS disk volume) to &quot;collapse&quot; under the weight of too many files? &lt;br /&gt;&lt;br /&gt; I have about a million specialized data files spread out on about a hundred CDs (no, it&apos;s not porn) and would like to load them onto one of those 500 GB Firewire drives formatted with NTFS.  I will spread them out among multiple directories, at least so that directory operations in various applications won&apos;t grind the OS to a halt.&lt;br&gt;
&lt;br&gt;
But I have some concerns about whether a huge number of files could corrupt the volume somehow or invite read/write problems.  Doesn&apos;t it take a lot of storage merely to index a million files?&lt;br&gt;
&lt;br&gt;
How many files is &quot;too many&quot;?  Is there any advantage to putting massive collections like this into zip files (if so I&apos;d be using command line info-zip)?</description>
		<guid isPermaLink="false">post:ask.metafilter.com,2007:site.56152</guid>
		<pubDate>Thu, 01 Feb 2007 08:12:22 -0800</pubDate>
		<dc:creator>chef_boyardee</dc:creator>
		
			<category>harddrive</category>
		
			<category>files</category>
		
			<category>ntfs</category>
		
			<category>windows</category>
		
			<category>win2k</category>
		
			<category>xp</category>
		
	</item> <item>
		<title>By: ReiToei</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845392</link>	
		<description>You&apos;d need to be pushing serious amounts of data about to &apos;break&apos; NTFS in the sense you&apos;re talking about. See &lt;a href=&quot;http://en.wikipedia.org/wiki/NTFS#Limitations&quot;&gt;here&lt;/a&gt; for the limitations of this file system.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845392</guid>
		<pubDate>Thu, 01 Feb 2007 08:22:53 -0800</pubDate>
		<dc:creator>ReiToei</dc:creator>
	</item><item>
		<title>By: deadmessenger</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845393</link>	
		<description>I currently support a production Windows 2003 server that is hosting close to 40 million small files, spread out over several thousand directories.    I don&apos;t think you&apos;re going to see an issue with the filesystem.   &lt;br&gt;
&lt;br&gt;
One issue that I have seen, however, is when you have a great many files in a single directory.  I had an issue on an Exchange server where the mailroot/vsi 1/badmail directory accumulated over 20 million small files over a period of about a year.  Disk performance was absolutely terrible on that machine as a result.   It also took almost a week just to delete the files in that directory.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845393</guid>
		<pubDate>Thu, 01 Feb 2007 08:23:09 -0800</pubDate>
		<dc:creator>deadmessenger</dc:creator>
	</item><item>
		<title>By: jellicle</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845395</link>	
		<description>Microsoft says you should be able to have 2^32 (4.3 billion) files per NTFS volume, so you should be fine.&lt;br&gt;
&lt;br&gt;
Spread them out in different directories as much as you are able.  Single directories with excessive numbers of files in them will have poor performance.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845395</guid>
		<pubDate>Thu, 01 Feb 2007 08:25:54 -0800</pubDate>
		<dc:creator>jellicle</dc:creator>
	</item><item>
		<title>By: jepler</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845417</link>	
		<description>It&apos;s a Linux system with ext3 filesystem, but I count approximately 1 million files in 125000 directories on my fileserver (it&apos;s a 300GB disk) with no troubles.  I assume other modern filesystems like NTFS are also up to this task.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845417</guid>
		<pubDate>Thu, 01 Feb 2007 08:51:21 -0800</pubDate>
		<dc:creator>jepler</dc:creator>
	</item><item>
		<title>By: Mitheral</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845442</link>	
		<description>One easily encountered problem with NTFS is the total directory path character count is fairly small.  ReiToei&apos;s link says the max filename size is 255 characters but that is actually the maximum path length.  You can have a filename with 255 characters however it can only be mounted at the root. If you have long directory names and a deep tree you can run into problems.  &lt;br&gt;
&lt;br&gt;
255 characters may seem like a lot but take for example your &quot;My Pictures&quot; folder.  On my machine the path (C:\Documents and Settings\Mitheral\My Documents\My Pictures) has already consumed 54 characters and I haven&apos;t even created a sub directory yet.  Office creates a folder in my pictures called &quot;Microsoft Clip Organizer&quot;, whoops there goes another 24 characters. Logitech creates the folders &quot;My Logitech Pictures\Pictures and Videos&quot; when you install one of their camera products.  Store anything in their and your path is 39+54=93 characters long and you haven&apos;t even got started naming your files or organising them into directories.&lt;br&gt;
&lt;br&gt;
But the worse offender is IE.  When you save a web page the default name given to the file and the associated directory is the title text.  If the webmaster has entered a magnum opus as the title it takes the whole thing.  The save dialog doesn&apos;t show the full name and users who use explorer in icon mode won&apos;t ever see the full thing.  Try to move it to a directory with a longer path though and windows throws errors.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845442</guid>
		<pubDate>Thu, 01 Feb 2007 09:08:19 -0800</pubDate>
		<dc:creator>Mitheral</dc:creator>
	</item><item>
		<title>By: damn dirty ape</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845450</link>	
		<description>The biggest issue you&apos;ll probably come across is file fragmentation and performance.  I&apos;d auto-schedule a defrag on a drive holding that many small files.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845450</guid>
		<pubDate>Thu, 01 Feb 2007 09:13:46 -0800</pubDate>
		<dc:creator>damn dirty ape</dc:creator>
	</item><item>
		<title>By: flabdablet</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845620</link>	
		<description>Directory fragmentation is also a real issue with NTFS.  If you&apos;re copying loads of files to an NTFS volume, and they&apos;re going to be essentially static once they get there, use Robocopy to do the copy, and do all the copies twice - once with the switch that makes it copy dummy zero-length files instead of the real ones, and a second time to copy the actual file contents.  Details about switches to use are in the Robocopy documentation, which I don&apos;t have access to right now.&lt;br&gt;
&lt;br&gt;
Or you could use a NAS box instead of a Firewire drive, which would allow you to use the ReiserFS filesystem on your 500GB disk.  ReiserFS has no trouble at all with hundreds of thousands of files in a single directory.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845620</guid>
		<pubDate>Thu, 01 Feb 2007 12:30:04 -0800</pubDate>
		<dc:creator>flabdablet</dc:creator>
	</item><item>
		<title>By: CrayDrygu</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845795</link>	
		<description>&lt;i&gt;&quot;ReiToei&apos;s link says the max filename size is 255 characters but that is actually the maximum path length.&quot;&lt;/i&gt;&lt;br&gt;
&lt;br&gt;
That&apos;s inaccurate.  Improperly written (or old) programs that don&apos;t use the newer APIs for handling filenames will have a 255 character limit on the entire path.  Modern and properly-written programs will not have that limitation.&lt;br&gt;
&lt;br&gt;
Of course, there&apos;s no shortage of older or crappy programs.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845795</guid>
		<pubDate>Thu, 01 Feb 2007 15:54:56 -0800</pubDate>
		<dc:creator>CrayDrygu</dc:creator>
	</item><item>
		<title>By: srboisvert</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845815</link>	
		<description>&lt;em&gt;&quot;ReiToei&apos;s link says the max filename size is 255 characters but that is actually the maximum path length.&quot;&lt;br&gt;
&lt;br&gt;
That&apos;s inaccurate. Improperly written (or old) programs that don&apos;t use the newer APIs for handling filenames will have a 255 character limit on the entire path. Modern and properly-written programs will not have that limitation.&lt;/em&gt;&lt;br&gt;
&lt;br&gt;
I ran into this problem with Windows Explorer and a usb backup drive.  One issue to keep in mind is that data still often travels between file system formats - NTFS to FAT32 or the CD formats.  Long directory structures can make this painful.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845815</guid>
		<pubDate>Thu, 01 Feb 2007 16:19:56 -0800</pubDate>
		<dc:creator>srboisvert</dc:creator>
	</item><item>
		<title>By: flabdablet</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#845982</link>	
		<description>The 255 character pathname limit doesn&apos;t apply to Robocopy, unless you turn on the switch to enable it (?!!) but does apply to most other command-line-based tools.  And it will apply in the same way regardless of what kind of filesystem you end up choosing.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-845982</guid>
		<pubDate>Thu, 01 Feb 2007 19:34:45 -0800</pubDate>
		<dc:creator>flabdablet</dc:creator>
	</item><item>
		<title>By: nataaniinez</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#846185</link>	
		<description>Bottom Line: Yes, your 500GB external drives will handle your million files as long as you span your files across folders. To keep your processor from pegging out, I would put about 125-150,000 files per folder.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-846185</guid>
		<pubDate>Fri, 02 Feb 2007 07:05:53 -0800</pubDate>
		<dc:creator>nataaniinez</dc:creator>
	</item><item>
		<title>By: Mitheral</title>
		<link>http://ask.metafilter.com/56152/Too-many-files#846197</link>	
		<description>It may be a side effect of our local conditions.  We have home drives H:\ on regular server (W2K3) shares whose paths are nice and short but our corporate shared drive Z:\ is using DFS. A side effect of which is the user doesn&apos;t know what the path length on the server is and it is sometimes fairly long.  So when they attempt to move some 130 character saved webpage from their H:\ to Z:\ they can get path length errors because z:\ just happens to have a directory path that is 20 characters longer than H:\ and that bumps them over the 255 character limit.  It could be IS is doing some sort of jiggery pokery with DFS that is causing this, we have quite a bit of legacy cruft and shims floating around.</description>
		<guid isPermaLink="false">comment:ask.metafilter.com,2007:site.56152-846197</guid>
		<pubDate>Fri, 02 Feb 2007 07:14:25 -0800</pubDate>
		<dc:creator>Mitheral</dc:creator>
	</item>
	</channel>
</rss>
