Sophie

Sophie

distrib > Mandriva > 2010.1 > x86_64 > by-pkgid > 965e33040dd61030a94f0eb89877aee8 > files > 3852

howto-html-en-20080722-2mdv2010.1.noarch.rpm

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<HTML>
<HEAD>
 <META NAME="GENERATOR" CONTENT="SGML-Tools 1.0.9">
 <TITLE>HOWTO: Multi Disk System Tuning: Clusters</TITLE>
 <LINK HREF="Multi-Disk-HOWTO-9.html" REL=next>
 <LINK HREF="Multi-Disk-HOWTO-7.html" REL=previous>
 <LINK HREF="Multi-Disk-HOWTO.html#toc8" REL=contents>
</HEAD>
<BODY>
<A HREF="Multi-Disk-HOWTO-9.html">Next</A>
<A HREF="Multi-Disk-HOWTO-7.html">Previous</A>
<A HREF="Multi-Disk-HOWTO.html#toc8">Contents</A>
<HR>
<H2><A NAME="s8">8. Clusters</A></H2>

<P>
<!--
disk!technologies!clusters
-->

In this section I will briefly touch on the ways machines can be connected
together but this is so big a topic it could be a separate HOWTO in its
own right, hint, hint. Also, strictly speaking, this section lies outside
the scope of this HOWTO, so if you feel like getting fame etc. <EM>you</EM> could
contact me and take over this part and turn it into a new document.
<P>These days computers gets outdated at an incredible rate. There is however
no reason why old hardware could not be put to good use with Linux. Using
an old and otherwise outdated computer as a network server can be both
useful in its own right as well as a valuable educational exercise. Such
a local networked cluster of computers can take on many forms but to remain
within the charter of this HOWTO I will limit myself to the disk strategies.
Nevertheless I would hope someone else could take on this topic and turn it
into a document on its own.
<P>This is an exciting area of activity today, and many forms of clustering
is available today, ranging from automatic workload balancing over local
network to more exotic hardware such as Scalable Coherent Interface (SCI)
which gives a tight integration of machines, effectively turning them into
a single machine. Various kinds of clustering has been available for larger
machines for some time and the VAXcluster is perhaps a well known example
of this. Clustering is done usually in order to share resources such as
disk drives, printers and terminals etc, but also processing resources equally
transparently between the computational nodes.
<P>There is no universal definition of clustering, in here it is taken to mean
a network of machines that combine their resources to serve users. Admittedly
this is a rather loose definition but this will change later.
<P>These days also Linux offers some clustering features but for a starter
I will just describe a simple local network. It is a good way of putting old
and otherwise unusable hardware to good use, as long as they can run Linux or
something similar.
<P>One of the best ways of using an old machine is as a network server in which
case the effective speed is more likely to be limited by network bandwidth
rather than pure computational performance. For home use you can move the
following functionality off to an older machine used as a server:
<UL>
<LI>news</LI>
<LI>mail</LI>
<LI>web proxy</LI>
<LI>printer server</LI>
<LI>modem server (PPP, SLIP, FAX, Voice mail)</LI>
</UL>
<P>You can also <CODE>NFS mount</CODE> drives from the server onto your workstation
thereby reducing drive space requirements. Still read the FSSTND to see
what directories should <EM>not</EM> be exported. The best candidates for
exporting to all machines are <CODE>/usr</CODE> and <CODE>/var/spool</CODE>
and possibly <CODE>/usr/local</CODE> but probably not <CODE>/var/spool/lpd</CODE>.
<P>Most of the time even slow disks will deliver sufficient performance. On
the other hand, if you do processing directly on the disks on the server or
have very fast networking, you might want to rethink your strategy and use
faster drives. Searching features on a web server or news database searches
are two examples of this.
<P>Such a network can be an excellent way of learning system administration
and building up your own toaster network, as it often is called. You can
get more information on this in other HOWTOs but there are two important
things you should keep in mind:
<UL>
<LI>Do not pull IP numbers out of thin air. Configure your inside net
using IP numbers reserved for private use, and use your network server
as a router that handles this IP masquerading.</LI>
<LI>Remember that if you additionally configure the router as a firewall
you might not be able to get to your own data from the outside, depending
on the firewall configuration.</LI>
</UL>
<P>The <EM>Nyx</EM> network provides an example of a cluster in the sense defined here.
It consists of the following machines:
<DL>
<DT><B>nyx</B><DD><P>is one of the two user login machines and also provides some
of the networking services.
<DT><B>nox</B><DD><P>(aka nyx10) is the main user login machine and is also the
mail server.
<DT><B>noc</B><DD><P>is a dedicated news server. The news spool is made accessible
through NFS mounting to nyx and nox.
<DT><B>arachne</B><DD><P>(aka www) is the web server. Web pages are written by
NFS mounting onto nox.
</DL>
<P>There are also some more advanced clustering projects going, notably
<UL>
<LI>
<A HREF="http://www.beowulf.org/">The Beowulf Project</A>
</LI>
<LI>
<A HREF="http://www.disi.unige.it/project/gamma/">The Genoa Active Message Machine (GAMMA)</A>
</LI>
</UL>
<P>
<P>High-tech clustering requires high-tech interconnect, and SCI is one of them.
To find out more you can either look up the home page of
<A HREF="http://www.dolphinics.no/">Dolphin Interconnect Solutions</A>
which is one of the main actors in this field,
or you can have a look at
<A HREF="http://www.scizzl.com/">scizzl</A>.
<P>
<P>Centralised mail servers using IMAP are becoming more and more popular
as disks become large enough to keep all mail stored indefinitely
and also cheap enough to make it a feasible option.
Unfortunately it has become clear that <CODE>NFS</CODE> mounting the mail
archives from another machine can cause corruption of the IMAP
database as the server software does not handle NFS timeouts too well,
and NFS timeouts are a rather common occurrence.
Keep therefore the mail archive local to the IMAP server.
<P>
<P>
<P>
<P>
<HR>
<A HREF="Multi-Disk-HOWTO-9.html">Next</A>
<A HREF="Multi-Disk-HOWTO-7.html">Previous</A>
<A HREF="Multi-Disk-HOWTO.html#toc8">Contents</A>
</BODY>
</HTML>