mirror of
https://github.com/torvalds/linux.git
synced 2024-11-17 17:41:44 +00:00
1674 lines
49 KiB
Plaintext
1674 lines
49 KiB
Plaintext
|
NOTE:
|
|||
|
This is one of the technical documents describing a component of
|
|||
|
Coda -- this document describes the client kernel-Venus interface.
|
|||
|
|
|||
|
For more information:
|
|||
|
http://www.coda.cs.cmu.edu
|
|||
|
For user level software needed to run Coda:
|
|||
|
ftp://ftp.coda.cs.cmu.edu
|
|||
|
|
|||
|
To run Coda you need to get a user level cache manager for the client,
|
|||
|
named Venus, as well as tools to manipulate ACLs, to log in, etc. The
|
|||
|
client needs to have the Coda filesystem selected in the kernel
|
|||
|
configuration.
|
|||
|
|
|||
|
The server needs a user level server and at present does not depend on
|
|||
|
kernel support.
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
The Venus kernel interface
|
|||
|
Peter J. Braam
|
|||
|
v1.0, Nov 9, 1997
|
|||
|
|
|||
|
This document describes the communication between Venus and kernel
|
|||
|
level filesystem code needed for the operation of the Coda file sys-
|
|||
|
tem. This document version is meant to describe the current interface
|
|||
|
(version 1.0) as well as improvements we envisage.
|
|||
|
______________________________________________________________________
|
|||
|
|
|||
|
Table of Contents
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
1. Introduction
|
|||
|
|
|||
|
2. Servicing Coda filesystem calls
|
|||
|
|
|||
|
3. The message layer
|
|||
|
|
|||
|
3.1 Implementation details
|
|||
|
|
|||
|
4. The interface at the call level
|
|||
|
|
|||
|
4.1 Data structures shared by the kernel and Venus
|
|||
|
4.2 The pioctl interface
|
|||
|
4.3 root
|
|||
|
4.4 lookup
|
|||
|
4.5 getattr
|
|||
|
4.6 setattr
|
|||
|
4.7 access
|
|||
|
4.8 create
|
|||
|
4.9 mkdir
|
|||
|
4.10 link
|
|||
|
4.11 symlink
|
|||
|
4.12 remove
|
|||
|
4.13 rmdir
|
|||
|
4.14 readlink
|
|||
|
4.15 open
|
|||
|
4.16 close
|
|||
|
4.17 ioctl
|
|||
|
4.18 rename
|
|||
|
4.19 readdir
|
|||
|
4.20 vget
|
|||
|
4.21 fsync
|
|||
|
4.22 inactive
|
|||
|
4.23 rdwr
|
|||
|
4.24 odymount
|
|||
|
4.25 ody_lookup
|
|||
|
4.26 ody_expand
|
|||
|
4.27 prefetch
|
|||
|
4.28 signal
|
|||
|
|
|||
|
5. The minicache and downcalls
|
|||
|
|
|||
|
5.1 INVALIDATE
|
|||
|
5.2 FLUSH
|
|||
|
5.3 PURGEUSER
|
|||
|
5.4 ZAPFILE
|
|||
|
5.5 ZAPDIR
|
|||
|
5.6 ZAPVNODE
|
|||
|
5.7 PURGEFID
|
|||
|
5.8 REPLACE
|
|||
|
|
|||
|
6. Initialization and cleanup
|
|||
|
|
|||
|
6.1 Requirements
|
|||
|
|
|||
|
|
|||
|
______________________________________________________________________
|
|||
|
0wpage
|
|||
|
|
|||
|
11.. IInnttrroodduuccttiioonn
|
|||
|
|
|||
|
|
|||
|
|
|||
|
A key component in the Coda Distributed File System is the cache
|
|||
|
manager, _V_e_n_u_s.
|
|||
|
|
|||
|
|
|||
|
When processes on a Coda enabled system access files in the Coda
|
|||
|
filesystem, requests are directed at the filesystem layer in the
|
|||
|
operating system. The operating system will communicate with Venus to
|
|||
|
service the request for the process. Venus manages a persistent
|
|||
|
client cache and makes remote procedure calls to Coda file servers and
|
|||
|
related servers (such as authentication servers) to service these
|
|||
|
requests it receives from the operating system. When Venus has
|
|||
|
serviced a request it replies to the operating system with appropriate
|
|||
|
return codes, and other data related to the request. Optionally the
|
|||
|
kernel support for Coda may maintain a minicache of recently processed
|
|||
|
requests to limit the number of interactions with Venus. Venus
|
|||
|
possesses the facility to inform the kernel when elements from its
|
|||
|
minicache are no longer valid.
|
|||
|
|
|||
|
This document describes precisely this communication between the
|
|||
|
kernel and Venus. The definitions of so called upcalls and downcalls
|
|||
|
will be given with the format of the data they handle. We shall also
|
|||
|
describe the semantic invariants resulting from the calls.
|
|||
|
|
|||
|
Historically Coda was implemented in a BSD file system in Mach 2.6.
|
|||
|
The interface between the kernel and Venus is very similar to the BSD
|
|||
|
VFS interface. Similar functionality is provided, and the format of
|
|||
|
the parameters and returned data is very similar to the BSD VFS. This
|
|||
|
leads to an almost natural environment for implementing a kernel-level
|
|||
|
filesystem driver for Coda in a BSD system. However, other operating
|
|||
|
systems such as Linux and Windows 95 and NT have virtual filesystem
|
|||
|
with different interfaces.
|
|||
|
|
|||
|
To implement Coda on these systems some reverse engineering of the
|
|||
|
Venus/Kernel protocol is necessary. Also it came to light that other
|
|||
|
systems could profit significantly from certain small optimizations
|
|||
|
and modifications to the protocol. To facilitate this work as well as
|
|||
|
to make future ports easier, communication between Venus and the
|
|||
|
kernel should be documented in great detail. This is the aim of this
|
|||
|
document.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
22.. SSeerrvviicciinngg CCooddaa ffiilleessyysstteemm ccaallllss
|
|||
|
|
|||
|
The service of a request for a Coda file system service originates in
|
|||
|
a process PP which accessing a Coda file. It makes a system call which
|
|||
|
traps to the OS kernel. Examples of such calls trapping to the kernel
|
|||
|
are _r_e_a_d_, _w_r_i_t_e_, _o_p_e_n_, _c_l_o_s_e_, _c_r_e_a_t_e_, _m_k_d_i_r_, _r_m_d_i_r_, _c_h_m_o_d in a Unix
|
|||
|
context. Similar calls exist in the Win32 environment, and are named
|
|||
|
_C_r_e_a_t_e_F_i_l_e_, .
|
|||
|
|
|||
|
Generally the operating system handles the request in a virtual
|
|||
|
filesystem (VFS) layer, which is named I/O Manager in NT and IFS
|
|||
|
manager in Windows 95. The VFS is responsible for partial processing
|
|||
|
of the request and for locating the specific filesystem(s) which will
|
|||
|
service parts of the request. Usually the information in the path
|
|||
|
assists in locating the correct FS drivers. Sometimes after extensive
|
|||
|
pre-processing, the VFS starts invoking exported routines in the FS
|
|||
|
driver. This is the point where the FS specific processing of the
|
|||
|
request starts, and here the Coda specific kernel code comes into
|
|||
|
play.
|
|||
|
|
|||
|
The FS layer for Coda must expose and implement several interfaces.
|
|||
|
First and foremost the VFS must be able to make all necessary calls to
|
|||
|
the Coda FS layer, so the Coda FS driver must expose the VFS interface
|
|||
|
as applicable in the operating system. These differ very significantly
|
|||
|
among operating systems, but share features such as facilities to
|
|||
|
read/write and create and remove objects. The Coda FS layer services
|
|||
|
such VFS requests by invoking one or more well defined services
|
|||
|
offered by the cache manager Venus. When the replies from Venus have
|
|||
|
come back to the FS driver, servicing of the VFS call continues and
|
|||
|
finishes with a reply to the kernel's VFS. Finally the VFS layer
|
|||
|
returns to the process.
|
|||
|
|
|||
|
As a result of this design a basic interface exposed by the FS driver
|
|||
|
must allow Venus to manage message traffic. In particular Venus must
|
|||
|
be able to retrieve and place messages and to be notified of the
|
|||
|
arrival of a new message. The notification must be through a mechanism
|
|||
|
which does not block Venus since Venus must attend to other tasks even
|
|||
|
when no messages are waiting or being processed.
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
Interfaces of the Coda FS Driver
|
|||
|
|
|||
|
Furthermore the FS layer provides for a special path of communication
|
|||
|
between a user process and Venus, called the pioctl interface. The
|
|||
|
pioctl interface is used for Coda specific services, such as
|
|||
|
requesting detailed information about the persistent cache managed by
|
|||
|
Venus. Here the involvement of the kernel is minimal. It identifies
|
|||
|
the calling process and passes the information on to Venus. When
|
|||
|
Venus replies the response is passed back to the caller in unmodified
|
|||
|
form.
|
|||
|
|
|||
|
Finally Venus allows the kernel FS driver to cache the results from
|
|||
|
certain services. This is done to avoid excessive context switches
|
|||
|
and results in an efficient system. However, Venus may acquire
|
|||
|
information, for example from the network which implies that cached
|
|||
|
information must be flushed or replaced. Venus then makes a downcall
|
|||
|
to the Coda FS layer to request flushes or updates in the cache. The
|
|||
|
kernel FS driver handles such requests synchronously.
|
|||
|
|
|||
|
Among these interfaces the VFS interface and the facility to place,
|
|||
|
receive and be notified of messages are platform specific. We will
|
|||
|
not go into the calls exported to the VFS layer but we will state the
|
|||
|
requirements of the message exchange mechanism.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
33.. TThhee mmeessssaaggee llaayyeerr
|
|||
|
|
|||
|
|
|||
|
|
|||
|
At the lowest level the communication between Venus and the FS driver
|
|||
|
proceeds through messages. The synchronization between processes
|
|||
|
requesting Coda file service and Venus relies on blocking and waking
|
|||
|
up processes. The Coda FS driver processes VFS- and pioctl-requests
|
|||
|
on behalf of a process P, creates messages for Venus, awaits replies
|
|||
|
and finally returns to the caller. The implementation of the exchange
|
|||
|
of messages is platform specific, but the semantics have (so far)
|
|||
|
appeared to be generally applicable. Data buffers are created by the
|
|||
|
FS Driver in kernel memory on behalf of P and copied to user memory in
|
|||
|
Venus.
|
|||
|
|
|||
|
The FS Driver while servicing P makes upcalls to Venus. Such an
|
|||
|
upcall is dispatched to Venus by creating a message structure. The
|
|||
|
structure contains the identification of P, the message sequence
|
|||
|
number, the size of the request and a pointer to the data in kernel
|
|||
|
memory for the request. Since the data buffer is re-used to hold the
|
|||
|
reply from Venus, there is a field for the size of the reply. A flags
|
|||
|
field is used in the message to precisely record the status of the
|
|||
|
message. Additional platform dependent structures involve pointers to
|
|||
|
determine the position of the message on queues and pointers to
|
|||
|
synchronization objects. In the upcall routine the message structure
|
|||
|
is filled in, flags are set to 0, and it is placed on the _p_e_n_d_i_n_g
|
|||
|
queue. The routine calling upcall is responsible for allocating the
|
|||
|
data buffer; its structure will be described in the next section.
|
|||
|
|
|||
|
A facility must exist to notify Venus that the message has been
|
|||
|
created, and implemented using available synchronization objects in
|
|||
|
the OS. This notification is done in the upcall context of the process
|
|||
|
P. When the message is on the pending queue, process P cannot proceed
|
|||
|
in upcall. The (kernel mode) processing of P in the filesystem
|
|||
|
request routine must be suspended until Venus has replied. Therefore
|
|||
|
the calling thread in P is blocked in upcall. A pointer in the
|
|||
|
message structure will locate the synchronization object on which P is
|
|||
|
sleeping.
|
|||
|
|
|||
|
Venus detects the notification that a message has arrived, and the FS
|
|||
|
driver allow Venus to retrieve the message with a getmsg_from_kernel
|
|||
|
call. This action finishes in the kernel by putting the message on the
|
|||
|
queue of processing messages and setting flags to READ. Venus is
|
|||
|
passed the contents of the data buffer. The getmsg_from_kernel call
|
|||
|
now returns and Venus processes the request.
|
|||
|
|
|||
|
At some later point the FS driver receives a message from Venus,
|
|||
|
namely when Venus calls sendmsg_to_kernel. At this moment the Coda FS
|
|||
|
driver looks at the contents of the message and decides if:
|
|||
|
|
|||
|
|
|||
|
+o the message is a reply for a suspended thread P. If so it removes
|
|||
|
the message from the processing queue and marks the message as
|
|||
|
WRITTEN. Finally, the FS driver unblocks P (still in the kernel
|
|||
|
mode context of Venus) and the sendmsg_to_kernel call returns to
|
|||
|
Venus. The process P will be scheduled at some point and continues
|
|||
|
processing its upcall with the data buffer replaced with the reply
|
|||
|
from Venus.
|
|||
|
|
|||
|
+o The message is a _d_o_w_n_c_a_l_l. A downcall is a request from Venus to
|
|||
|
the FS Driver. The FS driver processes the request immediately
|
|||
|
(usually a cache eviction or replacement) and when it finishes
|
|||
|
sendmsg_to_kernel returns.
|
|||
|
|
|||
|
Now P awakes and continues processing upcall. There are some
|
|||
|
subtleties to take account of. First P will determine if it was woken
|
|||
|
up in upcall by a signal from some other source (for example an
|
|||
|
attempt to terminate P) or as is normally the case by Venus in its
|
|||
|
sendmsg_to_kernel call. In the normal case, the upcall routine will
|
|||
|
deallocate the message structure and return. The FS routine can proceed
|
|||
|
with its processing.
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
Sleeping and IPC arrangements
|
|||
|
|
|||
|
In case P is woken up by a signal and not by Venus, it will first look
|
|||
|
at the flags field. If the message is not yet READ, the process P can
|
|||
|
handle its signal without notifying Venus. If Venus has READ, and
|
|||
|
the request should not be processed, P can send Venus a signal message
|
|||
|
to indicate that it should disregard the previous message. Such
|
|||
|
signals are put in the queue at the head, and read first by Venus. If
|
|||
|
the message is already marked as WRITTEN it is too late to stop the
|
|||
|
processing. The VFS routine will now continue. (-- If a VFS request
|
|||
|
involves more than one upcall, this can lead to complicated state, an
|
|||
|
extra field "handle_signals" could be added in the message structure
|
|||
|
to indicate points of no return have been passed.--)
|
|||
|
|
|||
|
|
|||
|
|
|||
|
33..11.. IImmpplleemmeennttaattiioonn ddeettaaiillss
|
|||
|
|
|||
|
The Unix implementation of this mechanism has been through the
|
|||
|
implementation of a character device associated with Coda. Venus
|
|||
|
retrieves messages by doing a read on the device, replies are sent
|
|||
|
with a write and notification is through the select system call on the
|
|||
|
file descriptor for the device. The process P is kept waiting on an
|
|||
|
interruptible wait queue object.
|
|||
|
|
|||
|
In Windows NT and the DPMI Windows 95 implementation a DeviceIoControl
|
|||
|
call is used. The DeviceIoControl call is designed to copy buffers
|
|||
|
from user memory to kernel memory with OPCODES. The sendmsg_to_kernel
|
|||
|
is issued as a synchronous call, while the getmsg_from_kernel call is
|
|||
|
asynchronous. Windows EventObjects are used for notification of
|
|||
|
message arrival. The process P is kept waiting on a KernelEvent
|
|||
|
object in NT and a semaphore in Windows 95.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44.. TThhee iinntteerrffaaccee aatt tthhee ccaallll lleevveell
|
|||
|
|
|||
|
|
|||
|
This section describes the upcalls a Coda FS driver can make to Venus.
|
|||
|
Each of these upcalls make use of two structures: inputArgs and
|
|||
|
outputArgs. In pseudo BNF form the structures take the following
|
|||
|
form:
|
|||
|
|
|||
|
|
|||
|
struct inputArgs {
|
|||
|
u_long opcode;
|
|||
|
u_long unique; /* Keep multiple outstanding msgs distinct */
|
|||
|
u_short pid; /* Common to all */
|
|||
|
u_short pgid; /* Common to all */
|
|||
|
struct CodaCred cred; /* Common to all */
|
|||
|
|
|||
|
<union "in" of call dependent parts of inputArgs>
|
|||
|
};
|
|||
|
|
|||
|
struct outputArgs {
|
|||
|
u_long opcode;
|
|||
|
u_long unique; /* Keep multiple outstanding msgs distinct */
|
|||
|
u_long result;
|
|||
|
|
|||
|
<union "out" of call dependent parts of inputArgs>
|
|||
|
};
|
|||
|
|
|||
|
|
|||
|
|
|||
|
Before going on let us elucidate the role of the various fields. The
|
|||
|
inputArgs start with the opcode which defines the type of service
|
|||
|
requested from Venus. There are approximately 30 upcalls at present
|
|||
|
which we will discuss. The unique field labels the inputArg with a
|
|||
|
unique number which will identify the message uniquely. A process and
|
|||
|
process group id are passed. Finally the credentials of the caller
|
|||
|
are included.
|
|||
|
|
|||
|
Before delving into the specific calls we need to discuss a variety of
|
|||
|
data structures shared by the kernel and Venus.
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
44..11.. DDaattaa ssttrruuccttuurreess sshhaarreedd bbyy tthhee kkeerrnneell aanndd VVeennuuss
|
|||
|
|
|||
|
|
|||
|
The CodaCred structure defines a variety of user and group ids as
|
|||
|
they are set for the calling process. The vuid_t and guid_t are 32 bit
|
|||
|
unsigned integers. It also defines group membership in an array. On
|
|||
|
Unix the CodaCred has proven sufficient to implement good security
|
|||
|
semantics for Coda but the structure may have to undergo modification
|
|||
|
for the Windows environment when these mature.
|
|||
|
|
|||
|
struct CodaCred {
|
|||
|
vuid_t cr_uid, cr_euid, cr_suid, cr_fsuid; /* Real, effective, set, fs uid*/
|
|||
|
vgid_t cr_gid, cr_egid, cr_sgid, cr_fsgid; /* same for groups */
|
|||
|
vgid_t cr_groups[NGROUPS]; /* Group membership for caller */
|
|||
|
};
|
|||
|
|
|||
|
|
|||
|
|
|||
|
NNOOTTEE It is questionable if we need CodaCreds in Venus. Finally Venus
|
|||
|
doesn't know about groups, although it does create files with the
|
|||
|
default uid/gid. Perhaps the list of group membership is superfluous.
|
|||
|
|
|||
|
|
|||
|
The next item is the fundamental identifier used to identify Coda
|
|||
|
files, the ViceFid. A fid of a file uniquely defines a file or
|
|||
|
directory in the Coda filesystem within a _c_e_l_l. (-- A _c_e_l_l is a
|
|||
|
group of Coda servers acting under the aegis of a single system
|
|||
|
control machine or SCM. See the Coda Administration manual for a
|
|||
|
detailed description of the role of the SCM.--)
|
|||
|
|
|||
|
|
|||
|
typedef struct ViceFid {
|
|||
|
VolumeId Volume;
|
|||
|
VnodeId Vnode;
|
|||
|
Unique_t Unique;
|
|||
|
} ViceFid;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
Each of the constituent fields: VolumeId, VnodeId and Unique_t are
|
|||
|
unsigned 32 bit integers. We envisage that a further field will need
|
|||
|
to be prefixed to identify the Coda cell; this will probably take the
|
|||
|
form of a Ipv6 size IP address naming the Coda cell through DNS.
|
|||
|
|
|||
|
The next important structure shared between Venus and the kernel is
|
|||
|
the attributes of the file. The following structure is used to
|
|||
|
exchange information. It has room for future extensions such as
|
|||
|
support for device files (currently not present in Coda).
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
struct coda_vattr {
|
|||
|
enum coda_vtype va_type; /* vnode type (for create) */
|
|||
|
u_short va_mode; /* files access mode and type */
|
|||
|
short va_nlink; /* number of references to file */
|
|||
|
vuid_t va_uid; /* owner user id */
|
|||
|
vgid_t va_gid; /* owner group id */
|
|||
|
long va_fsid; /* file system id (dev for now) */
|
|||
|
long va_fileid; /* file id */
|
|||
|
u_quad_t va_size; /* file size in bytes */
|
|||
|
long va_blocksize; /* blocksize preferred for i/o */
|
|||
|
struct timespec va_atime; /* time of last access */
|
|||
|
struct timespec va_mtime; /* time of last modification */
|
|||
|
struct timespec va_ctime; /* time file changed */
|
|||
|
u_long va_gen; /* generation number of file */
|
|||
|
u_long va_flags; /* flags defined for file */
|
|||
|
dev_t va_rdev; /* device special file represents */
|
|||
|
u_quad_t va_bytes; /* bytes of disk space held by file */
|
|||
|
u_quad_t va_filerev; /* file modification number */
|
|||
|
u_int va_vaflags; /* operations flags, see below */
|
|||
|
long va_spare; /* remain quad aligned */
|
|||
|
};
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
44..22.. TThhee ppiiooccttll iinntteerrffaaccee
|
|||
|
|
|||
|
|
|||
|
Coda specific requests can be made by application through the pioctl
|
|||
|
interface. The pioctl is implemented as an ordinary ioctl on a
|
|||
|
fictitious file /coda/.CONTROL. The pioctl call opens this file, gets
|
|||
|
a file handle and makes the ioctl call. Finally it closes the file.
|
|||
|
|
|||
|
The kernel involvement in this is limited to providing the facility to
|
|||
|
open and close and pass the ioctl message _a_n_d to verify that a path in
|
|||
|
the pioctl data buffers is a file in a Coda filesystem.
|
|||
|
|
|||
|
The kernel is handed a data packet of the form:
|
|||
|
|
|||
|
struct {
|
|||
|
const char *path;
|
|||
|
struct ViceIoctl vidata;
|
|||
|
int follow;
|
|||
|
} data;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
where
|
|||
|
|
|||
|
|
|||
|
struct ViceIoctl {
|
|||
|
caddr_t in, out; /* Data to be transferred in, or out */
|
|||
|
short in_size; /* Size of input buffer <= 2K */
|
|||
|
short out_size; /* Maximum size of output buffer, <= 2K */
|
|||
|
};
|
|||
|
|
|||
|
|
|||
|
|
|||
|
The path must be a Coda file, otherwise the ioctl upcall will not be
|
|||
|
made.
|
|||
|
|
|||
|
NNOOTTEE The data structures and code are a mess. We need to clean this
|
|||
|
up.
|
|||
|
|
|||
|
We now proceed to document the individual calls:
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..33.. rroooott
|
|||
|
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn empty
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_root_out {
|
|||
|
ViceFid VFid;
|
|||
|
} cfs_root;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This call is made to Venus during the initialization of
|
|||
|
the Coda filesystem. If the result is zero, the cfs_root structure
|
|||
|
contains the ViceFid of the root of the Coda filesystem. If a non-zero
|
|||
|
result is generated, its value is a platform dependent error code
|
|||
|
indicating the difficulty Venus encountered in locating the root of
|
|||
|
the Coda filesystem.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..44.. llooookkuupp
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Find the ViceFid and type of an object in a directory if it
|
|||
|
exists.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_lookup_in {
|
|||
|
ViceFid VFid;
|
|||
|
char *name; /* Place holder for data. */
|
|||
|
} cfs_lookup;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_lookup_out {
|
|||
|
ViceFid VFid;
|
|||
|
int vtype;
|
|||
|
} cfs_lookup;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This call is made to determine the ViceFid and filetype of
|
|||
|
a directory entry. The directory entry requested carries name name
|
|||
|
and Venus will search the directory identified by cfs_lookup_in.VFid.
|
|||
|
The result may indicate that the name does not exist, or that
|
|||
|
difficulty was encountered in finding it (e.g. due to disconnection).
|
|||
|
If the result is zero, the field cfs_lookup_out.VFid contains the
|
|||
|
targets ViceFid and cfs_lookup_out.vtype the coda_vtype giving the
|
|||
|
type of object the name designates.
|
|||
|
|
|||
|
The name of the object is an 8 bit character string of maximum length
|
|||
|
CFS_MAXNAMLEN, currently set to 256 (including a 0 terminator.)
|
|||
|
|
|||
|
It is extremely important to realize that Venus bitwise ors the field
|
|||
|
cfs_lookup.vtype with CFS_NOCACHE to indicate that the object should
|
|||
|
not be put in the kernel name cache.
|
|||
|
|
|||
|
NNOOTTEE The type of the vtype is currently wrong. It should be
|
|||
|
coda_vtype. Linux does not take note of CFS_NOCACHE. It should.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..55.. ggeettaattttrr
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Get the attributes of a file.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_getattr_in {
|
|||
|
ViceFid VFid;
|
|||
|
struct coda_vattr attr; /* XXXXX */
|
|||
|
} cfs_getattr;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_getattr_out {
|
|||
|
struct coda_vattr attr;
|
|||
|
} cfs_getattr;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This call returns the attributes of the file identified by
|
|||
|
fid.
|
|||
|
|
|||
|
EErrrroorrss Errors can occur if the object with fid does not exist, is
|
|||
|
unaccessible or if the caller does not have permission to fetch
|
|||
|
attributes.
|
|||
|
|
|||
|
NNoottee Many kernel FS drivers (Linux, NT and Windows 95) need to acquire
|
|||
|
the attributes as well as the Fid for the instantiation of an internal
|
|||
|
"inode" or "FileHandle". A significant improvement in performance on
|
|||
|
such systems could be made by combining the _l_o_o_k_u_p and _g_e_t_a_t_t_r calls
|
|||
|
both at the Venus/kernel interaction level and at the RPC level.
|
|||
|
|
|||
|
The vattr structure included in the input arguments is superfluous and
|
|||
|
should be removed.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..66.. sseettaattttrr
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Set the attributes of a file.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_setattr_in {
|
|||
|
ViceFid VFid;
|
|||
|
struct coda_vattr attr;
|
|||
|
} cfs_setattr;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
empty
|
|||
|
|
|||
|
DDeessccrriippttiioonn The structure attr is filled with attributes to be changed
|
|||
|
in BSD style. Attributes not to be changed are set to -1, apart from
|
|||
|
vtype which is set to VNON. Other are set to the value to be assigned.
|
|||
|
The only attributes which the FS driver may request to change are the
|
|||
|
mode, owner, groupid, atime, mtime and ctime. The return value
|
|||
|
indicates success or failure.
|
|||
|
|
|||
|
EErrrroorrss A variety of errors can occur. The object may not exist, may
|
|||
|
be inaccessible, or permission may not be granted by Venus.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..77.. aacccceessss
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_access_in {
|
|||
|
ViceFid VFid;
|
|||
|
int flags;
|
|||
|
} cfs_access;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
empty
|
|||
|
|
|||
|
DDeessccrriippttiioonn Verify if access to the object identified by VFid for
|
|||
|
operations described by flags is permitted. The result indicates if
|
|||
|
access will be granted. It is important to remember that Coda uses
|
|||
|
ACLs to enforce protection and that ultimately the servers, not the
|
|||
|
clients enforce the security of the system. The result of this call
|
|||
|
will depend on whether a _t_o_k_e_n is held by the user.
|
|||
|
|
|||
|
EErrrroorrss The object may not exist, or the ACL describing the protection
|
|||
|
may not be accessible.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..88.. ccrreeaattee
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Invoked to create a file
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_create_in {
|
|||
|
ViceFid VFid;
|
|||
|
struct coda_vattr attr;
|
|||
|
int excl;
|
|||
|
int mode;
|
|||
|
char *name; /* Place holder for data. */
|
|||
|
} cfs_create;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_create_out {
|
|||
|
ViceFid VFid;
|
|||
|
struct coda_vattr attr;
|
|||
|
} cfs_create;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This upcall is invoked to request creation of a file.
|
|||
|
The file will be created in the directory identified by VFid, its name
|
|||
|
will be name, and the mode will be mode. If excl is set an error will
|
|||
|
be returned if the file already exists. If the size field in attr is
|
|||
|
set to zero the file will be truncated. The uid and gid of the file
|
|||
|
are set by converting the CodaCred to a uid using a macro CRTOUID
|
|||
|
(this macro is platform dependent). Upon success the VFid and
|
|||
|
attributes of the file are returned. The Coda FS Driver will normally
|
|||
|
instantiate a vnode, inode or file handle at kernel level for the new
|
|||
|
object.
|
|||
|
|
|||
|
|
|||
|
EErrrroorrss A variety of errors can occur. Permissions may be insufficient.
|
|||
|
If the object exists and is not a file the error EISDIR is returned
|
|||
|
under Unix.
|
|||
|
|
|||
|
NNOOTTEE The packing of parameters is very inefficient and appears to
|
|||
|
indicate confusion between the system call creat and the VFS operation
|
|||
|
create. The VFS operation create is only called to create new objects.
|
|||
|
This create call differs from the Unix one in that it is not invoked
|
|||
|
to return a file descriptor. The truncate and exclusive options,
|
|||
|
together with the mode, could simply be part of the mode as it is
|
|||
|
under Unix. There should be no flags argument; this is used in open
|
|||
|
(2) to return a file descriptor for READ or WRITE mode.
|
|||
|
|
|||
|
The attributes of the directory should be returned too, since the size
|
|||
|
and mtime changed.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..99.. mmkkddiirr
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Create a new directory.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_mkdir_in {
|
|||
|
ViceFid VFid;
|
|||
|
struct coda_vattr attr;
|
|||
|
char *name; /* Place holder for data. */
|
|||
|
} cfs_mkdir;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_mkdir_out {
|
|||
|
ViceFid VFid;
|
|||
|
struct coda_vattr attr;
|
|||
|
} cfs_mkdir;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This call is similar to create but creates a directory.
|
|||
|
Only the mode field in the input parameters is used for creation.
|
|||
|
Upon successful creation, the attr returned contains the attributes of
|
|||
|
the new directory.
|
|||
|
|
|||
|
EErrrroorrss As for create.
|
|||
|
|
|||
|
NNOOTTEE The input parameter should be changed to mode instead of
|
|||
|
attributes.
|
|||
|
|
|||
|
The attributes of the parent should be returned since the size and
|
|||
|
mtime changes.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1100.. lliinnkk
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Create a link to an existing file.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_link_in {
|
|||
|
ViceFid sourceFid; /* cnode to link *to* */
|
|||
|
ViceFid destFid; /* Directory in which to place link */
|
|||
|
char *tname; /* Place holder for data. */
|
|||
|
} cfs_link;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
empty
|
|||
|
|
|||
|
DDeessccrriippttiioonn This call creates a link to the sourceFid in the directory
|
|||
|
identified by destFid with name tname. The source must reside in the
|
|||
|
target's parent, i.e. the source must be have parent destFid, i.e. Coda
|
|||
|
does not support cross directory hard links. Only the return value is
|
|||
|
relevant. It indicates success or the type of failure.
|
|||
|
|
|||
|
EErrrroorrss The usual errors can occur.0wpage
|
|||
|
|
|||
|
44..1111.. ssyymmlliinnkk
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy create a symbolic link
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_symlink_in {
|
|||
|
ViceFid VFid; /* Directory to put symlink in */
|
|||
|
char *srcname;
|
|||
|
struct coda_vattr attr;
|
|||
|
char *tname;
|
|||
|
} cfs_symlink;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn Create a symbolic link. The link is to be placed in the
|
|||
|
directory identified by VFid and named tname. It should point to the
|
|||
|
pathname srcname. The attributes of the newly created object are to
|
|||
|
be set to attr.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE The attributes of the target directory should be returned since
|
|||
|
its size changed.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1122.. rreemmoovvee
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Remove a file
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_remove_in {
|
|||
|
ViceFid VFid;
|
|||
|
char *name; /* Place holder for data. */
|
|||
|
} cfs_remove;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn Remove file named cfs_remove_in.name in directory
|
|||
|
identified by VFid.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE The attributes of the directory should be returned since its
|
|||
|
mtime and size may change.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1133.. rrmmddiirr
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Remove a directory
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_rmdir_in {
|
|||
|
ViceFid VFid;
|
|||
|
char *name; /* Place holder for data. */
|
|||
|
} cfs_rmdir;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn Remove the directory with name name from the directory
|
|||
|
identified by VFid.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE The attributes of the parent directory should be returned since
|
|||
|
its mtime and size may change.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1144.. rreeaaddlliinnkk
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Read the value of a symbolic link.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_readlink_in {
|
|||
|
ViceFid VFid;
|
|||
|
} cfs_readlink;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_readlink_out {
|
|||
|
int count;
|
|||
|
caddr_t data; /* Place holder for data. */
|
|||
|
} cfs_readlink;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This routine reads the contents of symbolic link
|
|||
|
identified by VFid into the buffer data. The buffer data must be able
|
|||
|
to hold any name up to CFS_MAXNAMLEN (PATH or NAM??).
|
|||
|
|
|||
|
EErrrroorrss No unusual errors.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1155.. ooppeenn
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Open a file.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_open_in {
|
|||
|
ViceFid VFid;
|
|||
|
int flags;
|
|||
|
} cfs_open;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_open_out {
|
|||
|
dev_t dev;
|
|||
|
ino_t inode;
|
|||
|
} cfs_open;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This request asks Venus to place the file identified by
|
|||
|
VFid in its cache and to note that the calling process wishes to open
|
|||
|
it with flags as in open(2). The return value to the kernel differs
|
|||
|
for Unix and Windows systems. For Unix systems the Coda FS Driver is
|
|||
|
informed of the device and inode number of the container file in the
|
|||
|
fields dev and inode. For Windows the path of the container file is
|
|||
|
returned to the kernel.
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE Currently the cfs_open_out structure is not properly adapted to
|
|||
|
deal with the Windows case. It might be best to implement two
|
|||
|
upcalls, one to open aiming at a container file name, the other at a
|
|||
|
container file inode.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1166.. cclloossee
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Close a file, update it on the servers.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_close_in {
|
|||
|
ViceFid VFid;
|
|||
|
int flags;
|
|||
|
} cfs_close;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn Close the file identified by VFid.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE The flags argument is bogus and not used. However, Venus' code
|
|||
|
has room to deal with an execp input field, probably this field should
|
|||
|
be used to inform Venus that the file was closed but is still memory
|
|||
|
mapped for execution. There are comments about fetching versus not
|
|||
|
fetching the data in Venus vproc_vfscalls. This seems silly. If a
|
|||
|
file is being closed, the data in the container file is to be the new
|
|||
|
data. Here again the execp flag might be in play to create confusion:
|
|||
|
currently Venus might think a file can be flushed from the cache when
|
|||
|
it is still memory mapped. This needs to be understood.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1177.. iiooccttll
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Do an ioctl on a file. This includes the pioctl interface.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_ioctl_in {
|
|||
|
ViceFid VFid;
|
|||
|
int cmd;
|
|||
|
int len;
|
|||
|
int rwflag;
|
|||
|
char *data; /* Place holder for data. */
|
|||
|
} cfs_ioctl;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
|
|||
|
struct cfs_ioctl_out {
|
|||
|
int len;
|
|||
|
caddr_t data; /* Place holder for data. */
|
|||
|
} cfs_ioctl;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Do an ioctl operation on a file. The command, len and
|
|||
|
data arguments are filled as usual. flags is not used by Venus.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE Another bogus parameter. flags is not used. What is the
|
|||
|
business about PREFETCHING in the Venus code?
|
|||
|
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1188.. rreennaammee
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Rename a fid.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_rename_in {
|
|||
|
ViceFid sourceFid;
|
|||
|
char *srcname;
|
|||
|
ViceFid destFid;
|
|||
|
char *destname;
|
|||
|
} cfs_rename;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn Rename the object with name srcname in directory
|
|||
|
sourceFid to destname in destFid. It is important that the names
|
|||
|
srcname and destname are 0 terminated strings. Strings in Unix
|
|||
|
kernels are not always null terminated.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..1199.. rreeaaddddiirr
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Read directory entries.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_readdir_in {
|
|||
|
ViceFid VFid;
|
|||
|
int count;
|
|||
|
int offset;
|
|||
|
} cfs_readdir;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_readdir_out {
|
|||
|
int size;
|
|||
|
caddr_t data; /* Place holder for data. */
|
|||
|
} cfs_readdir;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Read directory entries from VFid starting at offset and
|
|||
|
read at most count bytes. Returns the data in data and returns
|
|||
|
the size in size.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE This call is not used. Readdir operations exploit container
|
|||
|
files. We will re-evaluate this during the directory revamp which is
|
|||
|
about to take place.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2200.. vvggeett
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy instructs Venus to do an FSDB->Get.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_vget_in {
|
|||
|
ViceFid VFid;
|
|||
|
} cfs_vget;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_vget_out {
|
|||
|
ViceFid VFid;
|
|||
|
int vtype;
|
|||
|
} cfs_vget;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This upcall asks Venus to do a get operation on an fsobj
|
|||
|
labelled by VFid.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE This operation is not used. However, it is extremely useful
|
|||
|
since it can be used to deal with read/write memory mapped files.
|
|||
|
These can be "pinned" in the Venus cache using vget and released with
|
|||
|
inactive.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2211.. ffssyynncc
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Tell Venus to update the RVM attributes of a file.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_fsync_in {
|
|||
|
ViceFid VFid;
|
|||
|
} cfs_fsync;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn Ask Venus to update RVM attributes of object VFid. This
|
|||
|
should be called as part of kernel level fsync type calls. The
|
|||
|
result indicates if the syncing was successful.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE Linux does not implement this call. It should.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2222.. iinnaaccttiivvee
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Tell Venus a vnode is no longer in use.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_inactive_in {
|
|||
|
ViceFid VFid;
|
|||
|
} cfs_inactive;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
none
|
|||
|
|
|||
|
DDeessccrriippttiioonn This operation returns EOPNOTSUPP.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE This should perhaps be removed.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2233.. rrddwwrr
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Read or write from a file
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct cfs_rdwr_in {
|
|||
|
ViceFid VFid;
|
|||
|
int rwflag;
|
|||
|
int count;
|
|||
|
int offset;
|
|||
|
int ioflag;
|
|||
|
caddr_t data; /* Place holder for data. */
|
|||
|
} cfs_rdwr;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct cfs_rdwr_out {
|
|||
|
int rwflag;
|
|||
|
int count;
|
|||
|
caddr_t data; /* Place holder for data. */
|
|||
|
} cfs_rdwr;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This upcall asks Venus to read or write from a file.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE It should be removed since it is against the Coda philosophy that
|
|||
|
read/write operations never reach Venus. I have been told the
|
|||
|
operation does not work. It is not currently used.
|
|||
|
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2244.. ooddyymmoouunntt
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Allows mounting multiple Coda "filesystems" on one Unix mount
|
|||
|
point.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn
|
|||
|
|
|||
|
struct ody_mount_in {
|
|||
|
char *name; /* Place holder for data. */
|
|||
|
} ody_mount;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
|
|||
|
struct ody_mount_out {
|
|||
|
ViceFid VFid;
|
|||
|
} ody_mount;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Asks Venus to return the rootfid of a Coda system named
|
|||
|
name. The fid is returned in VFid.
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE This call was used by David for dynamic sets. It should be
|
|||
|
removed since it causes a jungle of pointers in the VFS mounting area.
|
|||
|
It is not used by Coda proper. Call is not implemented by Venus.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2255.. ooddyy__llooookkuupp
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Looks up something.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn irrelevant
|
|||
|
|
|||
|
|
|||
|
oouutt
|
|||
|
irrelevant
|
|||
|
|
|||
|
DDeessccrriippttiioonn
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE Gut it. Call is not implemented by Venus.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2266.. ooddyy__eexxppaanndd
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy expands something in a dynamic set.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn irrelevant
|
|||
|
|
|||
|
oouutt
|
|||
|
irrelevant
|
|||
|
|
|||
|
DDeessccrriippttiioonn
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE Gut it. Call is not implemented by Venus.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2277.. pprreeffeettcchh
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Prefetch a dynamic set.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn Not documented.
|
|||
|
|
|||
|
oouutt
|
|||
|
Not documented.
|
|||
|
|
|||
|
DDeessccrriippttiioonn Venus worker.cc has support for this call, although it is
|
|||
|
noted that it doesn't work. Not surprising, since the kernel does not
|
|||
|
have support for it. (ODY_PREFETCH is not a defined operation).
|
|||
|
|
|||
|
EErrrroorrss
|
|||
|
|
|||
|
NNOOTTEE Gut it. It isn't working and isn't used by Coda.
|
|||
|
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
44..2288.. ssiiggnnaall
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Send Venus a signal about an upcall.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
iinn none
|
|||
|
|
|||
|
oouutt
|
|||
|
not applicable.
|
|||
|
|
|||
|
DDeessccrriippttiioonn This is an out-of-band upcall to Venus to inform Venus
|
|||
|
that the calling process received a signal after Venus read the
|
|||
|
message from the input queue. Venus is supposed to clean up the
|
|||
|
operation.
|
|||
|
|
|||
|
EErrrroorrss No reply is given.
|
|||
|
|
|||
|
NNOOTTEE We need to better understand what Venus needs to clean up and if
|
|||
|
it is doing this correctly. Also we need to handle multiple upcall
|
|||
|
per system call situations correctly. It would be important to know
|
|||
|
what state changes in Venus take place after an upcall for which the
|
|||
|
kernel is responsible for notifying Venus to clean up (e.g. open
|
|||
|
definitely is such a state change, but many others are maybe not).
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
55.. TThhee mmiinniiccaacchhee aanndd ddoowwnnccaallllss
|
|||
|
|
|||
|
|
|||
|
The Coda FS Driver can cache results of lookup and access upcalls, to
|
|||
|
limit the frequency of upcalls. Upcalls carry a price since a process
|
|||
|
context switch needs to take place. The counterpart of caching the
|
|||
|
information is that Venus will notify the FS Driver that cached
|
|||
|
entries must be flushed or renamed.
|
|||
|
|
|||
|
The kernel code generally has to maintain a structure which links the
|
|||
|
internal file handles (called vnodes in BSD, inodes in Linux and
|
|||
|
FileHandles in Windows) with the ViceFid's which Venus maintains. The
|
|||
|
reason is that frequent translations back and forth are needed in
|
|||
|
order to make upcalls and use the results of upcalls. Such linking
|
|||
|
objects are called ccnnooddeess.
|
|||
|
|
|||
|
The current minicache implementations have cache entries which record
|
|||
|
the following:
|
|||
|
|
|||
|
1. the name of the file
|
|||
|
|
|||
|
2. the cnode of the directory containing the object
|
|||
|
|
|||
|
3. a list of CodaCred's for which the lookup is permitted.
|
|||
|
|
|||
|
4. the cnode of the object
|
|||
|
|
|||
|
The lookup call in the Coda FS Driver may request the cnode of the
|
|||
|
desired object from the cache, by passing its name, directory and the
|
|||
|
CodaCred's of the caller. The cache will return the cnode or indicate
|
|||
|
that it cannot be found. The Coda FS Driver must be careful to
|
|||
|
invalidate cache entries when it modifies or removes objects.
|
|||
|
|
|||
|
When Venus obtains information that indicates that cache entries are
|
|||
|
no longer valid, it will make a downcall to the kernel. Downcalls are
|
|||
|
intercepted by the Coda FS Driver and lead to cache invalidations of
|
|||
|
the kind described below. The Coda FS Driver does not return an error
|
|||
|
unless the downcall data could not be read into kernel memory.
|
|||
|
|
|||
|
|
|||
|
55..11.. IINNVVAALLIIDDAATTEE
|
|||
|
|
|||
|
|
|||
|
No information is available on this call.
|
|||
|
|
|||
|
|
|||
|
55..22.. FFLLUUSSHH
|
|||
|
|
|||
|
|
|||
|
|
|||
|
AArrgguummeennttss None
|
|||
|
|
|||
|
SSuummmmaarryy Flush the name cache entirely.
|
|||
|
|
|||
|
DDeessccrriippttiioonn Venus issues this call upon startup and when it dies. This
|
|||
|
is to prevent stale cache information being held. Some operating
|
|||
|
systems allow the kernel name cache to be switched off dynamically.
|
|||
|
When this is done, this downcall is made.
|
|||
|
|
|||
|
|
|||
|
55..33.. PPUURRGGEEUUSSEERR
|
|||
|
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
struct cfs_purgeuser_out {/* CFS_PURGEUSER is a venus->kernel call */
|
|||
|
struct CodaCred cred;
|
|||
|
} cfs_purgeuser;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Remove all entries in the cache carrying the Cred. This
|
|||
|
call is issued when tokens for a user expire or are flushed.
|
|||
|
|
|||
|
|
|||
|
55..44.. ZZAAPPFFIILLEE
|
|||
|
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
struct cfs_zapfile_out { /* CFS_ZAPFILE is a venus->kernel call */
|
|||
|
ViceFid CodaFid;
|
|||
|
} cfs_zapfile;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Remove all entries which have the (dir vnode, name) pair.
|
|||
|
This is issued as a result of an invalidation of cached attributes of
|
|||
|
a vnode.
|
|||
|
|
|||
|
NNOOTTEE Call is not named correctly in NetBSD and Mach. The minicache
|
|||
|
zapfile routine takes different arguments. Linux does not implement
|
|||
|
the invalidation of attributes correctly.
|
|||
|
|
|||
|
|
|||
|
|
|||
|
55..55.. ZZAAPPDDIIRR
|
|||
|
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
struct cfs_zapdir_out { /* CFS_ZAPDIR is a venus->kernel call */
|
|||
|
ViceFid CodaFid;
|
|||
|
} cfs_zapdir;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Remove all entries in the cache lying in a directory
|
|||
|
CodaFid, and all children of this directory. This call is issued when
|
|||
|
Venus receives a callback on the directory.
|
|||
|
|
|||
|
|
|||
|
55..66.. ZZAAPPVVNNOODDEE
|
|||
|
|
|||
|
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
struct cfs_zapvnode_out { /* CFS_ZAPVNODE is a venus->kernel call */
|
|||
|
struct CodaCred cred;
|
|||
|
ViceFid VFid;
|
|||
|
} cfs_zapvnode;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Remove all entries in the cache carrying the cred and VFid
|
|||
|
as in the arguments. This downcall is probably never issued.
|
|||
|
|
|||
|
|
|||
|
55..77.. PPUURRGGEEFFIIDD
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
struct cfs_purgefid_out { /* CFS_PURGEFID is a venus->kernel call */
|
|||
|
ViceFid CodaFid;
|
|||
|
} cfs_purgefid;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn Flush the attribute for the file. If it is a dir (odd
|
|||
|
vnode), purge its children from the namecache and remove the file from the
|
|||
|
namecache.
|
|||
|
|
|||
|
|
|||
|
|
|||
|
55..88.. RREEPPLLAACCEE
|
|||
|
|
|||
|
|
|||
|
SSuummmmaarryy Replace the Fid's for a collection of names.
|
|||
|
|
|||
|
AArrgguummeennttss
|
|||
|
|
|||
|
struct cfs_replace_out { /* cfs_replace is a venus->kernel call */
|
|||
|
ViceFid NewFid;
|
|||
|
ViceFid OldFid;
|
|||
|
} cfs_replace;
|
|||
|
|
|||
|
|
|||
|
|
|||
|
DDeessccrriippttiioonn This routine replaces a ViceFid in the name cache with
|
|||
|
another. It is added to allow Venus during reintegration to replace
|
|||
|
locally allocated temp fids while disconnected with global fids even
|
|||
|
when the reference counts on those fids are not zero.
|
|||
|
|
|||
|
0wpage
|
|||
|
|
|||
|
66.. IInniittiiaalliizzaattiioonn aanndd cclleeaannuupp
|
|||
|
|
|||
|
|
|||
|
This section gives brief hints as to desirable features for the Coda
|
|||
|
FS Driver at startup and upon shutdown or Venus failures. Before
|
|||
|
entering the discussion it is useful to repeat that the Coda FS Driver
|
|||
|
maintains the following data:
|
|||
|
|
|||
|
|
|||
|
1. message queues
|
|||
|
|
|||
|
2. cnodes
|
|||
|
|
|||
|
3. name cache entries
|
|||
|
|
|||
|
The name cache entries are entirely private to the driver, so they
|
|||
|
can easily be manipulated. The message queues will generally have
|
|||
|
clear points of initialization and destruction. The cnodes are
|
|||
|
much more delicate. User processes hold reference counts in Coda
|
|||
|
filesystems and it can be difficult to clean up the cnodes.
|
|||
|
|
|||
|
It can expect requests through:
|
|||
|
|
|||
|
1. the message subsystem
|
|||
|
|
|||
|
2. the VFS layer
|
|||
|
|
|||
|
3. pioctl interface
|
|||
|
|
|||
|
Currently the _p_i_o_c_t_l passes through the VFS for Coda so we can
|
|||
|
treat these similarly.
|
|||
|
|
|||
|
|
|||
|
66..11.. RReeqquuiirreemmeennttss
|
|||
|
|
|||
|
|
|||
|
The following requirements should be accommodated:
|
|||
|
|
|||
|
1. The message queues should have open and close routines. On Unix
|
|||
|
the opening of the character devices are such routines.
|
|||
|
|
|||
|
+o Before opening, no messages can be placed.
|
|||
|
|
|||
|
+o Opening will remove any old messages still pending.
|
|||
|
|
|||
|
+o Close will notify any sleeping processes that their upcall cannot
|
|||
|
be completed.
|
|||
|
|
|||
|
+o Close will free all memory allocated by the message queues.
|
|||
|
|
|||
|
|
|||
|
2. At open the namecache shall be initialized to empty state.
|
|||
|
|
|||
|
3. Before the message queues are open, all VFS operations will fail.
|
|||
|
Fortunately this can be achieved by making sure than mounting the
|
|||
|
Coda filesystem cannot succeed before opening.
|
|||
|
|
|||
|
4. After closing of the queues, no VFS operations can succeed. Here
|
|||
|
one needs to be careful, since a few operations (lookup,
|
|||
|
read/write, readdir) can proceed without upcalls. These must be
|
|||
|
explicitly blocked.
|
|||
|
|
|||
|
5. Upon closing the namecache shall be flushed and disabled.
|
|||
|
|
|||
|
6. All memory held by cnodes can be freed without relying on upcalls.
|
|||
|
|
|||
|
7. Unmounting the file system can be done without relying on upcalls.
|
|||
|
|
|||
|
8. Mounting the Coda filesystem should fail gracefully if Venus cannot
|
|||
|
get the rootfid or the attributes of the rootfid. The latter is
|
|||
|
best implemented by Venus fetching these objects before attempting
|
|||
|
to mount.
|
|||
|
|
|||
|
NNOOTTEE NetBSD in particular but also Linux have not implemented the
|
|||
|
above requirements fully. For smooth operation this needs to be
|
|||
|
corrected.
|
|||
|
|
|||
|
|
|||
|
|