Talk:Howto mirror, spider, or archive a website

Is there any way to make wget not download files larger than a given size when mirroring?
 * Well i parsed through the the man it doesn't look so, sorry. Some possible options.
 * Specify the type of files to download
 * Use regular expressions to ignore files that have some commonality in the name
 * Ignore certain directory's
 * Obviously if this is not applicable, It isn't useful at all. Further more some servers won't even let u see the file size.  So it looks like wget wont help you. unless i missed something.  One thing you could try is writing a script to read the directory, and if the ling has a file size less that what you want, then wget the file, if else ignore it.  If its at dir enter it and so one.  Obviously this i significantly harder and if you do make it you should publish it one wikihowto.  I would serch google and see if there is any solutions out there.  But I don't see how wget can do it, sorry.  ZyMOS