123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186 |
- ROMFS - ROM FILE SYSTEM
- This is a quite dumb, read only filesystem, mainly for initial RAM
- disks of installation disks. It has grown up by the need of having
- modules linked at boot time. Using this filesystem, you get a very
- similar feature, and even the possibility of a small kernel, with a
- file system which doesn't take up useful memory from the router
- functions in the basement of your office.
- For comparison, both the older minix and xiafs (the latter is now
- defunct) filesystems, compiled as module need more than 20000 bytes,
- while romfs is less than a page, about 4000 bytes (assuming i586
- code). Under the same conditions, the msdos filesystem would need
- about 30K (and does not support device nodes or symlinks), while the
- nfs module with nfsroot is about 57K. Furthermore, as a bit unfair
- comparison, an actual rescue disk used up 3202 blocks with ext2, while
- with romfs, it needed 3079 blocks.
- To create such a file system, you'll need a user program named
- genromfs. It is available on http://romfs.sourceforge.net/
- As the name suggests, romfs could be also used (space-efficiently) on
- various read-only media, like (E)EPROM disks if someone will have the
- motivation.. :)
- However, the main purpose of romfs is to have a very small kernel,
- which has only this filesystem linked in, and then can load any module
- later, with the current module utilities. It can also be used to run
- some program to decide if you need SCSI devices, and even IDE or
- floppy drives can be loaded later if you use the "initrd"--initial
- RAM disk--feature of the kernel. This would not be really news
- flash, but with romfs, you can even spare off your ext2 or minix or
- maybe even affs filesystem until you really know that you need it.
- For example, a distribution boot disk can contain only the cd disk
- drivers (and possibly the SCSI drivers), and the ISO 9660 filesystem
- module. The kernel can be small enough, since it doesn't have other
- filesystems, like the quite large ext2fs module, which can then be
- loaded off the CD at a later stage of the installation. Another use
- would be for a recovery disk, when you are reinstalling a workstation
- from the network, and you will have all the tools/modules available
- from a nearby server, so you don't want to carry two disks for this
- purpose, just because it won't fit into ext2.
- romfs operates on block devices as you can expect, and the underlying
- structure is very simple. Every accessible structure begins on 16
- byte boundaries for fast access. The minimum space a file will take
- is 32 bytes (this is an empty file, with a less than 16 character
- name). The maximum overhead for any non-empty file is the header, and
- the 16 byte padding for the name and the contents, also 16+14+15 = 45
- bytes. This is quite rare however, since most file names are longer
- than 3 bytes, and shorter than 15 bytes.
- The layout of the filesystem is the following:
- offset content
- +---+---+---+---+
- 0 | - | r | o | m | \
- +---+---+---+---+ The ASCII representation of those bytes
- 4 | 1 | f | s | - | / (i.e. "-rom1fs-")
- +---+---+---+---+
- 8 | full size | The number of accessible bytes in this fs.
- +---+---+---+---+
- 12 | checksum | The checksum of the FIRST 512 BYTES.
- +---+---+---+---+
- 16 | volume name | The zero terminated name of the volume,
- : : padded to 16 byte boundary.
- +---+---+---+---+
- xx | file |
- : headers :
- Every multi byte value (32 bit words, I'll use the longwords term from
- now on) must be in big endian order.
- The first eight bytes identify the filesystem, even for the casual
- inspector. After that, in the 3rd longword, it contains the number of
- bytes accessible from the start of this filesystem. The 4th longword
- is the checksum of the first 512 bytes (or the number of bytes
- accessible, whichever is smaller). The applied algorithm is the same
- as in the AFFS filesystem, namely a simple sum of the longwords
- (assuming bigendian quantities again). For details, please consult
- the source. This algorithm was chosen because although it's not quite
- reliable, it does not require any tables, and it is very simple.
- The following bytes are now part of the file system; each file header
- must begin on a 16 byte boundary.
- offset content
- +---+---+---+---+
- 0 | next filehdr|X| The offset of the next file header
- +---+---+---+---+ (zero if no more files)
- 4 | spec.info | Info for directories/hard links/devices
- +---+---+---+---+
- 8 | size | The size of this file in bytes
- +---+---+---+---+
- 12 | checksum | Covering the meta data, including the file
- +---+---+---+---+ name, and padding
- 16 | file name | The zero terminated name of the file,
- : : padded to 16 byte boundary
- +---+---+---+---+
- xx | file data |
- : :
- Since the file headers begin always at a 16 byte boundary, the lowest
- 4 bits would be always zero in the next filehdr pointer. These four
- bits are used for the mode information. Bits 0..2 specify the type of
- the file; while bit 4 shows if the file is executable or not. The
- permissions are assumed to be world readable, if this bit is not set,
- and world executable if it is; except the character and block devices,
- they are never accessible for other than owner. The owner of every
- file is user and group 0, this should never be a problem for the
- intended use. The mapping of the 8 possible values to file types is
- the following:
- mapping spec.info means
- 0 hard link link destination [file header]
- 1 directory first file's header
- 2 regular file unused, must be zero [MBZ]
- 3 symbolic link unused, MBZ (file data is the link content)
- 4 block device 16/16 bits major/minor number
- 5 char device - " -
- 6 socket unused, MBZ
- 7 fifo unused, MBZ
- Note that hard links are specifically marked in this filesystem, but
- they will behave as you can expect (i.e. share the inode number).
- Note also that it is your responsibility to not create hard link
- loops, and creating all the . and .. links for directories. This is
- normally done correctly by the genromfs program. Please refrain from
- using the executable bits for special purposes on the socket and fifo
- special files, they may have other uses in the future. Additionally,
- please remember that only regular files, and symlinks are supposed to
- have a nonzero size field; they contain the number of bytes available
- directly after the (padded) file name.
- Another thing to note is that romfs works on file headers and data
- aligned to 16 byte boundaries, but most hardware devices and the block
- device drivers are unable to cope with smaller than block-sized data.
- To overcome this limitation, the whole size of the file system must be
- padded to an 1024 byte boundary.
- If you have any problems or suggestions concerning this file system,
- please contact me. However, think twice before wanting me to add
- features and code, because the primary and most important advantage of
- this file system is the small code. On the other hand, don't be
- alarmed, I'm not getting that much romfs related mail. Now I can
- understand why Avery wrote poems in the ARCnet docs to get some more
- feedback. :)
- romfs has also a mailing list, and to date, it hasn't received any
- traffic, so you are welcome to join it to discuss your ideas. :)
- It's run by ezmlm, so you can subscribe to it by sending a message
- to [email protected], the content is irrelevant.
- Pending issues:
- - Permissions and owner information are pretty essential features of a
- Un*x like system, but romfs does not provide the full possibilities.
- I have never found this limiting, but others might.
- - The file system is read only, so it can be very small, but in case
- one would want to write _anything_ to a file system, he still needs
- a writable file system, thus negating the size advantages. Possible
- solutions: implement write access as a compile-time option, or a new,
- similarly small writable filesystem for RAM disks.
- - Since the files are only required to have alignment on a 16 byte
- boundary, it is currently possibly suboptimal to read or execute files
- from the filesystem. It might be resolved by reordering file data to
- have most of it (i.e. except the start and the end) laying at "natural"
- boundaries, thus it would be possible to directly map a big portion of
- the file contents to the mm subsystem.
- - Compression might be an useful feature, but memory is quite a
- limiting factor in my eyes.
- - Where it is used?
- - Does it work on other architectures than intel and motorola?
- Have fun,
- Janos Farkas <[email protected]>
|