Chapter 1_Network Programming
Chapter 1_Network Programming
Learning Objectives
After reading this chapter, you should:
• have a high level appreciation of the basic means by which messages are sent and
received on modern networks;
• be familiar with the most important protocols used on networks;
• understand the addressing mechanism used on the Internet;
• understand the basic principles of client/server programming.
The fundamental purpose of this opening chapter is to introduce the underpinning
network principles and associated terminology with which the reader will need to be
familiar in order to make sense of the later chapters of this book. The material cov-
ered here is entirely generic (as far as any programming language is concerned) and
it is not until the next chapter that we shall begin to consider how Java may be used
in network programming. If the meaning of any term covered here is not clear when
that term is later encountered in context, the reader should refer back to this chapter
to refresh his/her memory.
It would be very easy to make this chapter considerably larger than it currently
is, simply by including a great deal of dry, technical material that would be unlikely
to be of any practical use to the intended readers of this book. However, this chapter
is intentionally brief, the author having avoided the inclusion of material that is not
of relevance to the use of Java for network programming. The reader who already
has a sound grasp of network concepts may safely skip this chapter entirely.
The most common categories of network software nowadays are clients and servers.
These two categories have a symbiotic relationship and the term client/server
programming has become very widely used in recent years. It is important to
distinguish firstly between a server and the machine upon which the server is running
(called the host machine), since I.T. workers often refer loosely to the host machine
as ‘the server’. Though this common usage has no detrimental practical effects for
the majority of I.T. tasks, those I.T. personnel who are unaware of the distinction
and subsequently undertake network programming are likely to be caused a signifi-
cant amount of conceptual confusion until this distinction is made known to them.
A server, as the name implies, provides a service of some kind. This service is
provided for clients that connect to the server’s host machine specifically for the
purpose of accessing the service. Thus, it is the clients that initiate a dialogue with
the server. (These clients, of course, are also programs and are not human clients!)
Common services provided by such servers include the ‘serving up’ of Web pages
(by Web servers) and the downloading of files from servers’ host machines via the
File Transfer Protocol (FTP servers). For the former service, the corresponding
client programs would be Web browsers (such as Firefox, Chrome or Internet
Explorer). Though a client and its corresponding server will normally run on dif-
ferent machines in a real-world application, it is perfectly possible for such pro-
grams to run on the same machine. Indeed, it is often very convenient (as will be
seen in subsequent chapters) for server and client(s) to be run on the same machine,
since this provides a very convenient ‘sandbox’ within which such applications
may be tested before being released (or, more likely, before final testing on sepa-
rate machines). This avoids the need for multiple machines and multiple testing
personnel.
In some applications, such as messaging services, it is possible for programs on
users’ machines to communicate directly with each other in what is called peer-to-
peer (or P2P) mode. However, for many applications, this is either not possible or
prohibitively costly in terms of the number of simultaneous connections required.
For example, the World Wide Web simply does not allow clients to communicate
directly with each other. However, some applications use a server as an intermedi-
ary, in order to provide ‘simulated’peer-to-peer facilities. Alternatively, both ends of
the dialogue may act as both client and server. Peer-to-peer systems are beyond the
intended scope of this text, though, and no further mention will be made of them.
These entities lie at the heart of network communications. For anybody not already
familiar with the use of these terms in a network programming context, the two
words very probably conjure up images of hardware components. However,
although they are closely associated with the hardware communication links
between computers within a network, ports and sockets are not themselves hard-
ware elements, but abstract concepts that allow the programmer to make use of
those communication links.
A port is a logical connection to a computer (as opposed to a physical connec-
tion) and is identified by a number in the range 1–65535. This number has no
1.3 The Internet and IP Addresses 3
Whatever the service provided by a server, there must be some established proto-
col governing the communication that takes place between server and client. Each
end of the dialogue must know what may/must be sent to the other, the format in
which it should be sent, the sequence in which it must be sent (if sequence matters)
and, for ‘open-ended’ dialogues, how the dialogue is to be terminated. For the
standard services, such protocols are made available in public documents, usually
by either the Internet Engineering Task Force (IETF) or the World Wide Web
Consortium (W3C). Some of the more common services and their associated ports
are shown in Table 1.1. For a more esoteric or ‘bespoke’ service, the application
writer must establish a protocol and convey it to the intended users of that service.
A URL (Uniform Resource Locator) is a unique identifier for any resource located
on the Internet. It has the following structure (in which BNF notation is used):
<protocol>://<hostname>[:<port>][/<pathname>]
[/<filename>[#<section>]]
For example:
http://www.oracle.com/technetwork/java/javase/down-
loads/index.html
For a well-known protocol, the port number may be omitted and the default port
number will be assumed. Thus, since the example above specifies the HTTP proto-
col (the protocol of the Web) and does not specify on which port of the host machine
the service is available, it will be assumed that the service is running on port 80 (the
default port for Web servers). If the file name is omitted, then the server sends a
default file from the directory specified in the path name. (This default file will com-
monly be called index.html or default.html.) The ‘section’ part of the URL (not often
specified) indicates a named ‘anchor’ in an HTML document. For example, the
HTML anchor in the tag
<A HREF="#summary">Summary of Report</A>
would be referred to as summary by the section component of the URL.
Since human beings are generally much better at remembering meaningful
strings of characters than they are at remembering long strings of numbers, the
Domain Name System was developed. A domain name, also known as a host name,
is the user-friendly equivalent of an IP address. In the previous example of a URL,
the domain name was www.oracle.com. The individual parts of a domain name
don’t correspond to the individual parts of an IP address. In fact, domain names
don’t always have four parts (as IPv4 addresses must have).
Normally, human beings will use domain names in preference to IP addresses,
but they can just as well use the corresponding IP addresses (if they know what they
are!). The Domain Name System provides a mapping between IP addresses and
domain names and is held in a distributed database. The IP address system and the
DNS are governed by ICANN (the Internet Corporation for Assigned Names and
Numbers), which is a non-profitmaking organisation. When a URL is submitted to
a browser, the DNS automatically converts the domain name part into its numeric IP
equivalent.
1.5 TCP
packets from the same message. IP is concerned with the routing of these packets
through an internet. Introduced by the American military during the Cold War, it
was designed from the outset to be robust. In the event of a military strike against
one of the network routers, the rest of the network had to continue to function as
normal, with messages that would have gone through the damaged router being re-
routed. IP is responsible for this re-routing. It attaches the IP address of the intended
recipient to each packet and then tries to determine the most efficient route available
to get to the ultimate destination (taking damaged routers into account).
However, since packets could still arrive out of sequence, be corrupted or even not
arrive at all (without indication to either sender or intended recipient that anything
had gone wrong), it was decided to place another protocol layer on top of IP. This
further layer was provided by TCP (Transmission Control Protocol), which allowed
each end of a connection to acknowledge receipt of IP packets and/or request
retransmission of lost or corrupted packets. In addition, TCP allows the packets to
be rearranged into their correct sequence at the receiving end. IP and TCP are the
two commonest protocols used on the Internet and are almost invariably coupled
together as TCP/IP. TCP is the higher level protocol that uses the lower level IP.
For Internet applications, a four-layer model is often used, which is represented
diagrammatically in Fig. 1.1 below. The transport layer will often comprise the TCP
protocol, but may be UDP (described in the next section), while the internet layer
will always be IP. Each layer of the model represents a different level of abstraction,
with higher levels representing higher abstraction. Thus, although applications may
appear to be communicating directly with each other, they are actually communicat-
ing directly only with their transport layers. The transport and internet layers, in
their turn, communicate directly only with the layers immediately above and below
them, while the host-to-network layer communicates directly only with the IP layer
at each end of the connection. When a message is sent by the application layer at
one end of the connection, it passes through each of the lower layers. As it does so,
each layer adds further protocol data specific to the particular protocol at that level.
For the TCP layer, this process involves breaking up the data packets into TCP seg-
ments and adding sequence numbers and checksums; for the IP layer, it involves
placing the TCP segments into IP packets called datagrams and adding the routing
details. The host-to-network layer then converts the digital data into an analogue
form suitable for transmission over the carrier wire, sends the data and converts it
back into digital form at the receiving end.
Logical
At the receiving end, the message travels up through the layers until it reaches
the receiving application layer. As it does so, each layer converts the message into a
form suitable for receipt by the next layer (effectively reversing the corresponding
process carried out at the sending end) and carries out checks appropriate to its own
protocol. If recalculation of checksums reveals that some of the data has been cor-
rupted or checking of sequence numbers shows that some data has not been received,
then the transport layer requests re-transmission of the corrupt/missing data.
Otherwise, the transport layer acknowledges receipt of the packets. All of this is
completely transparent to the application layer. Once all the data has been received,
converted and correctly sequenced, it is presented to the recipient application layer
as though that layer had been in direct communication with the sending application
layer. The latter may then send a response in exactly the same manner (and so on).
In fact, since TCP provides full duplex transmission, the two ends of the connection
may be sending data simultaneously.
The above description has deliberately hidden many of the low-level details of
implementation, particularly the tasks carried out by the host-to-network layer.
In addition, of course, the initial transmission may have passed through several
routers and their associated layers before arriving at its ultimate destination.
However, this high-level view covers the basic stages that are involved and is quite
sufficient for our purposes.
Another network model that is often referred to is the seven-layer Open Systems
Interconnection (OSI) model. However, this model is an unnecessarily complex one
for our purposes and is better suited to non-TCP/IP networks anyway.
1.6 UDP
the sound/video without interruption and losing a few bytes of data was much better
than waiting for re-transmission of the missing data.
Nowadays, network transmission speeds are considerably greater than they were
only a few years ago, meaning that UDP is now a feasible transport mechanism for
applications in which it would not once have been considered. In addition to this, it
is much easier for TCP packets to get through firewalls than it is for UDP packets to
do so, since Web administrators tend to allow TCP packets from remote port 80s to
pass through unchallenged. For these reasons, the choice of whether to use TCP or
UDP for speed-critical applications is not nearly as clear cut as it used to be.