Pro Node.js for Developers Free ebook download
- Install,configureanddeployNode.jsappseffectively
- UnderstandtheNode.jsasynchronousprogrammingmodelindetail
- Createbothwebandnetwork-basedNode.jsapplicationswithease
- Learntoworkeffectivelywithvarieddatasourcesandfiletypes
- Discoveradvancedsoftwareengineeringconceptsthatwillsaveyoutime
For your convenience Apress has placed some of the front
matter material after the index. Please use the Bookmarks
and Contents at a Glance links to access them.
Contents at a Glance
About the Author ...............................................................................................................
Acknowledgments
■Chapter 1: Getting Started
■
Chapter 2: The Node Module System
■
Chapter 3: The Node Programming Model .......................................................................
■
Chapter 4: Events and Timers ..........................................................................................
■
Chapter 5: The Command Line Interface .........................................................................
■
Chapter 6: The File System ..............................................................................................
■
Chapter 7: Streams ..........................................................................................................
■
Chapter 8: Binary Data ..................................................................................................
■
Chapter 9: Executing Code ............................................................................................
■
Chapter 10: Network Programming ...............................................................................
■
Chapter 11: HTTP ...........................................................................................................
■
Chapter 12: The Express Framework ............................................................................
■
Chapter 13: The Real-Time Web ....................................................................................
■
Chapter 14: Databases ..................................................................................................
■
Chapter 15: Logging, Debugging, and Testing
■
Chapter 16: Application Scaling ....................................................................................
■ Appendix A: JavaScript Object Notation ........................................................................
Index .................................................................................................................................
Introduction
Since its creation in 2009, Node.js has grown into a powerful and increasingly popular asynchronous development framework, used for creating highly scalable JavaScript applications. Respected companies such as Dow Jones, LinkedIn, and Walmart are among the many organizations to have seen Node’s potential and adopted it into their businesses.
Pro Node.js for Developers provides a comprehensive guide to this exciting young technology. You will be introduced to Node at a high level before diving deeply into the key concepts and APIs that underpin its operation.
Building upon your existing JavaScript skills, you’ll be shown how to use Node.js to build both web- and network-based applications, to deal with various data sources, capture and generate events, spawn and control child processes, and much more.
Once you’ve mastered these skills, you’ll learn more advanced software engineering skills that will give your code a professional edge. You’ll learn how to create easily reusable code modules, debug and test your applications quickly and effectively, and scale your code from a single thread to the cloud as demand for your application increases.
CHAPTER 1
Getting Started
JavaScript was initially named Mocha when it was developed at Netscape in 1995 by Brendan Eich. In September 1995, beta releases of Netscape Navigator 2.0 were shipped with Mocha, which had been renamed LiveScript. By December 1995 LiveScript, after another renaming, had become JavaScript, the current name. Around that time Netscape was working closely with Sun, the company responsible for creating the Java programming language. The choice of the name JavaScript caused a lot of speculation. Many people thought that Netscape was trying to piggyback on the hot name Java, a buzzword at the time. Unfortunately, the naming choice caused a lot of confusion, as many automatically assumed that the two languages were related somehow. In reality they have very little in common.
Despite the confusion, JavaScript became a very successful client-side scripting language. In response to JavaScript’s success, Microsoft created its own implementation, named JScript, and released it with Internet Explorer 3.0 in August 1996. In November 1996 Netscape submitted JavaScript for standardization to Ecma International, an international standards organization. In June 1997 JavaScript became the standard ECMA-262.
Over the years, JavaScript has remained the de facto standard for client-side development. However, the server space was a completely different story. For the most part, the server realm has belonged to languages such as PHP and Java. A number of projects have implemented JavaScript as a server language, but none of them were particularly successful. Two major hurdles blocked JavaScript’s widespread adoption on the server. The first was its reputation. JavaScript has long been viewed as a toy language, suitable only for amateurs. The second hurdle was JavaScript’s poor performance compared with that of some other languages.
However, JavaScript had one big thing going for it. The Web was undergoing unprecedented growth, and the browser wars were raging. As the only language supported by every major browser, JavaScript engines began receiving attention from Google, Apple, and other companies. All of that attention led to huge improvements in JavaScript performance. Suddenly JavaScript wasn’t lagging anymore.
The development community took note of JavaScript’s newfound power and began creating interesting applications. In 2009 Ryan Dahl created Node.js, a framework primarily used to create highly scalable servers for web applications. Node.js, or simply Node, is written in C++ and JavaScript. To drive Node, Dahl tapped into the power of Google’s V8 JavaScript engine (V8 is the engine inside Google Chrome, the most popular browser in existence). Using V8, developers can write full-blown applications in JavaScript - applications that would normally be written in a language like C or Java. Thus, with the invention of Node, JavaScript finally became a bona fide server-side language.
The Node Execution Model
In addition to speed, Node brought an unconventional execution model to the table. To understand how Node is different, we should compare it with Apache, the popular web server in the Linux, Apache, MySQL, and PHP (LAMP) software stack. First, Apache processes only HTTP requests, leaving application logic to be implemented in a language such as PHP or Java. Node removes a layer of complexity by combining server and application logic in one place. Some developers have criticized this model for eliminating the traditional separation of concerns employed in the LAMP stack. However, this approach also gives Node unprecedented flexibility as a server.
Node also differs from many other servers in its use of concurrency. A server like Apache maintains a pool of threads for handling client connections. This approach lacks scalability because threads are fairly resource-intensive. Additionally, a busy server quickly consumes all of the available threads; as a result, more threads, which are expensive to create and tear down, are spawned. Node, on the other hand, executes within a single thread. While this may seem like a bad idea, in practice it works well because of the way most server applications work. Normally, a server receives a client request, then performs some high-latency I/O operation such as a file read or database query. During this time the server blocks, waiting for the I/O operation to complete. Instead of sitting idle, the server could be handling more requests or doing other useful work.
In traditional servers, it’s acceptable for a thread to do nothing while blocking on an I/O operation. However, Node has only one thread, and blocking it causes the entire server to hang. To mitigate this problem, Node uses nonblocking I/O almost exclusively. For example, if Node needs to perform a database query, it simply issues the query and then processes something else. When the query finally returns, it triggers an asynchronous callback function that is responsible for processing the query’s results. A pseudocode example of this process is shown in Listing 1-1.
Listing 1-1.
Pseudocode Example of a Nonblocking Database Query var sql = "SELECT * FROM table"; database.query(sql, function(results) { // process the results }); // do something else instead of waiting
Node’s nonblocking, asynchronous execution model provides extremely scalable server solutions with minimal overhead. Many high-profile companies, including Microsoft, LinkedIn, Yahoo!, and the retail giant Walmart have taken notice of Node and begun implementing projects with it. For example, LinkedIn migrated its entire mobile stack to Node and “went from running 15 servers with 15 instances (virtual servers) on each physical machine, to just four instances that can handle double the traffic.” Node has also received significant media recognition, such as winning the 2012 InfoWorld Technology of the Year Award.
Installing Node
The first step to getting started with Node is installation. This section will help you get Node up and running on your Ubuntu, OS X, or Windows machine. The simplest way to install Node is via the Install button on the Node home page, his will download the binaries or installer appropriate for your operating system.
Figure 1-1. Installing Node from the project home page You can also browse all of the platforms’ binaries, installers, and source code at
Windows users will most likely want to download the Windows Installer ( .msi file), while Mac users should opt for the Mac OS X Installer ( .pkg file). Linux and SunOS users can download binaries, but it is probably simpler to install using a package manager.
Installing via Package Managers
For instructions on installing Node via your operating system’s package manager, go to
. This page contains
instructions for Windows, OS X, and Linux. Again, Windows and Mac users should use the previously discussed installers. As far as Linux is concerned, instructions are available for Gentoo, Debian, Linux Mint, Ubuntu, openSUSE, SLE, Red Hat, Fedora, Arch Linux, FreeBSD, and OpenBSD.
Ubuntu users can install Node and all requisite software using the Advanced Packaging Tool (APT) commands npm, Node’s package management software (covered in Chapter 2). shown in Listing 1-2. These steps also install Listing 1-2.
Installing Node Using Ubuntu’s Package Manager $ sudo apt-get install python-software-properties python g++ make $ sudo add-apt-repository ppa:chris-lea/node.js $ sudo apt-get update $ sudo apt-get install nodejs npm
If the add-apt-repository command fails, install the software-properties-common package using the command shown in Listing 1-3.
Listing 1-3. Installing the Software-Properties-Common Package
$ sudo apt-get install software-properties-common Building from Source
If you want to contribute to Node’s C++ core, or simply experiment with its functionality, you will need to compile the project’s source code. You can obtain the source code from the download page, or from the project’s GitHub repositor . Once the code is downloaded, extract it from the archive if applicable. Prior to building Node, Ubuntu users need to install Python and other build tools; use the command shown in Listing 1-4. When installing Python, be sure to install version 2.7, not the newer Python 3.
Listing 1-4. Installing Prerequisite Software Packages on Ubuntu
$ sudo apt-get install python-software-properties python g++ make Ubuntu and OS X users can build Node by issuing the commands shown in Listing 1-5 from within the source code directory. Note that the full path to the source code directory should not contain any spaces.
Listing 1-5.
Installing Node from Source on Ubuntu and OS X ./configure make sudo make install
On Windows, you need to install Visual C++ and Python 2.7 in order to build Node. Visual C++ can be downloaded for free from Microsoft with Visual Studio Express. Python is also available free of charge at
o compile Node, issue the command shown in Listing 1-6.
Listing 1-6. Installing Node from Source on Windows
> vcbuild.bat release
Final Installation Steps
No matter which installation route you decided on, by this point Node should be ready to use. To verify that everything is set up correctly, open a new terminal window, and run the node executable (see Listing 1-7). The -v flag causes Node to print the installed version and then exit. In this example, version 0.10.18 of Node is installed.
Listing 1-7.
Checking the Version of Node from the Command Line $ node -v v0.10.18
You should also verify that npm is installed (see Listing 1-8).
Listing 1-8. Checking the Version of npm from the Command Line
$ npm -v
1.3.8 A final installation note: it’s likely that you’ll need to install Python and a C++ compiler on your machine even if you didn’t install Node from source. Doing this ensures that native modules written in C++ can be compiled and run with your Node installation. On Windows, this involves installing Microsoft’s Visual C++ compiler (see the previous section, “Building from Source”). For any other operating system, the build essentials should include the necessary compiler.
The Read-Eval-Print-Loop
Node provides an interactive shell, known as the Read-Eval-Print-Loop, or REPL. The REPL reads input from the user, evaluates the input as JavaScript code, prints the result, and then waits for more input. The REPL is useful for debugging and for experimenting with small snippets of JavaScript code. To start the REPL, launch Node with no command line arguments. You then see the REPL command prompt, the > character. From the prompt, begin entering arbitrary JavaScript code.
Listing 1-9 shows how to start the REPL and input code. In this example, a variable, named foo, is created with the string value "Hello World!". On the third line, the REPL prints "undefined" because the variable declaration statement returns no value. Next, the statement foo; causes the value of foo to be inspected. As expected, the REPL returns the string "Hello World!". Finally, the value of foo is printed to the terminal using the console.log() function. After foo is printed, the REPL displays "undefined" again, because console.log() returns no value.
Listing 1-9. Starting the REPL and Inputting JavaScript Code
$ node > var foo = "Hello World!"; undefined > foo; 'Hello World!' > console.log(foo); Hello World! undefined for loop has been entered into the REPL
You can also enter multiline expressions in the REPL. For example, a ... is used by the REPL to indicate a multiline expression in progress. Note that ... is displayed in Listing 1-10. The by the REPL, not typed by the user.
Listing 1-10.
An Example of Executing a Multiline Expression in the REPL > for (var i = 0; i < 3; i++) { ... console.log(i); ... }
1
2 undefined
REPL Features
The REPL has a number of features that increase usability, the most useful of which is the ability to browse previously issued commands using the up and down arrow keys. To terminate any command and return to a blank prompt, type Control+C. Pressing Control+C twice from a blank line causes the REPL to terminate. You can quit the REPL at any time by pressing Control+D. You can use the Tab key to see a list of possible completions to the current command. If there is only one possible option, Node automatically inserts it. The list includes keywords, functions, and variables. For example, Listing 1-11 shows the completion options when t is entered at the prompt.
Listing 1-11. t Followed by Tab
Autocomplete Options Shown by Typing > t this throw true try typeof tls tty toLocaleString toString
The REPL also provides a special variable, _ (underscore), that always contains the result of the last expression. Listing 1-12 shows several example uses of _. First, an array of strings is created, causing _ to reference the array. The pop() method is then used to remove the last element of the array, baz. Finally, the length of baz is accessed, causing _ to become 3.
Listing 1-12. Example Uses of the _ Variable
> ["foo", "bar", "baz"] [ 'foo', 'bar', 'baz' ] > _.pop(); 'baz' > _.length
3 > _
.help
.help command displays all of the available REPL commands. Listing 1-13 shows the output of running the The .help command.
Listing 1-13. .help REPL Command
Output of the > .help .break Sometimes you get stuck, this gets you out .clear Alias for .break .exit Exit the repl .help Show repl options .load Load JS from a file into the REPL session .save Save all evaluated commands in this REPL session to a file
.exit The .exit command terminates the REPL. This command is equivalent to pressing Control+D. .break
The .break command, used to bail out of a multiline expression, is useful if you make a mistake or simply choose not to complete the expression. Listing 1-14 shows an example of using the .break command to terminate a for loop prior to completion. Notice that the normal > prompt is shown after the .break command.
Listing 1-14. .break Command
Terminating a Multiline Expression Using the > for (var i = 0; i < 10; i++) { ... .break >
.save filename
The .save command saves the current REPL session to the file specified in filename. If the file does not exist, it is created. If the file does exist, the existing file is overwritten. REPL commands and output are not saved. Listing 1-15 shows an example use of the .save command. In this example, the current session is saved to the file repl-test.js. The resulting contents of repl-test.js are shown in Listing 1-16. Notice that the file does not contain the REPL prompt or output or the .save command.
Listing 1-15. Saving the Current REPL Session Using the .save Command
> var foo = [1, 2, 3]; undefined > foo.forEach(function(value) { ... console.log(value); ... });
1
2
3 undefined > .save repl-test.js Session saved to:repl-test.js
Listing 1-16. repl-test.js Generated by the .save Command
The Contents of var foo = [1, 2, 3]; foo.forEach(function(value) { console.log(value); });
.load filename
The .load command executes the JavaScript file specified in filename. The file is executed as if each line were typed directly into the REPL. Listing 1-17 shows the output of loading the file repl-test.js from Listing 1-16.
Listing 1-17. The result of executing repl-test.js, using the .load command
> .load repl-test.js > var foo = [1, 2, 3]; undefined > foo.forEach(function(value) { ... console.log(value); ... });
1
2
3 undefined
.clear
Similar to .break, .clear can be used to terminate multiline expressions. .clear is also used to reset the REPL’s context object. At this point, you don’t need to understand the details, but Listing 1-18 shows a Node program that embeds a REPL. In other words, running this program actually invokes an instance of the REPL. Additionally, you can define a custom execution environment for the REPL. In this case, the embedded REPL has a defined variable, foo, that holds the string "Hello REPL". Calling .clear from within the embedded REPL resets the context and deletes foo.
Listing 1-18. Embedding a REPL Within Another Node Program
var repl = require("repl"); repl.start({}).context.foo = "Hello REPL";
Executing Node Programs
Although the REPL environment is useful, it is seldom used in production systems. Instead, programs are written as one or more JavaScript files and then interpreted by Node. The simplest Node program is shown in Listing 1-19.
"Hello World!" to the console. The example simply prints the string Listing 1-19.
Source Code for the Node Hello World! Program console.log("Hello World!"); Copy the code in Listing 1-19 into a new file, and save it as hello.js. Next, open a terminal window, and execute hello.js (see Listing 1-20). Note that Node does not require you to specify the .js file extension. If the input file is not found and no file extension is provided, Node will try adding the extensions .js, .json, and .node. Node interprets .js files as JavaScript source code and files with a .json extension as JavaScript Object Notation (JSON) files. Files with a .node extension are treated as compiled add-on modules.
Listing 1-20. Executing a Node Program from the Command Line
$ node hello.js
■ Note JSON is a plain text standard for data interchange. This book assumes that the reader is already familiar with
JSON. However, if you need an introduction or refresher, JSON is covered in Appendix A.Summary
Congratulations! You have officially taken the first steps toward developing Node applications. This chapter has given you a high-level introduction to Node and guided you through the installation process. You have even written some Node code using the REPL. The remainder of this book builds on this chapter, covering the most important aspects of Node development. Node is best known for creating scalable web servers, so of course that feature is covered. However, you’ll also learn much more, including file system programming, streaming data, application scaling, and Node’s module system.
CHAPTER 2
The Node Module System
As a developer, you can solve many complex problems using the core Node functionality. However, one of Node’s true strengths is its developer community and abundance of third-party modules. Keeping track of all of these modules is Node’s package manager, npm. The npm FAQ page jokingly states that npm is not an acronym for “Node package manager” and instead is a recursive backronym abbreviation for “ npm is not an acronym.” Regardless of its meaning, npm is a command line tool that, since Node version 0.6.3, comes bundled with the Node environment.
What npm does—and does very well—is manage Node modules and their dependencies. At the time of writing, there were over 47,000 packages in the official registry. You can browse all of the available packages at the registry’s site . In addition to each individual module, the site shows various rankings, including which modules are the most popular and which are depended upon the most. If you’d rather get your hands dirty on the command line, you can search the registry using the npm search command, which lets you search for packages based on one or more keywords. For example, npm search can be used to locate all the modules containing the word database in the name or description (see Listing 2-1). The first time you run this command, expect to experience a short delay as npm builds a local index.
Listing 2-1. Using npm search to Locate Modules in the npm Registry
$ npm search database
Installing Packages
In order to use a module, you must install it on your machine. This is normally as simple as downloading a few JavaScript source files (some modules require downloading or compiling binaries as well). To install a package, type npm install, followed by the package name. For example, the commander module provides methods for implementing command line interfaces. To install the latest version of commander, issue the command shown in Listing 2-2.
Listing 2-2. Installing the Latest Version of the commander Package Using npm
$ npm install commander If you’re not interested in installing the latest version of a package, you can specify a version number. Node modules follow a major.minor.patch versioning scheme. For example, to install commander version 1.0.0, use the command shown in Listing 2-3. The @ character is used to separate the package name from the version.
Listing 2-3.
Installing Version 1.0.0 of commander $ npm install [email protected] Changes to the major version number can indicate that a module has changed in a non-backwards-compatible way (known as a breaking change). Even changes to the minor version can accidentally introduce breaking changes. npm supports with the
Therefore, you’ll typically want to install the latest patch of a certain release—a scenario that x wildcard. The command shown in Listing 2-4 installs the latest patch of version 1.0 of commander. (Note that the x wildcard can also be used in place of the major and minor revisions.)
Listing 2-4. commander 1.0
Installing the Latest Patch of $ npm install [email protected]
You can also select versions using relational version range descriptors. Relational version range descriptors select the most recent version that matches a given set of criteria. The various relational version range descriptors supported by npm are listed in Ta
Table 2-1. Relational Version Range Descriptors
Relational Version Range Descriptor Version Criteria =version Exactly matches version
>version Greater than version. >=version Greater than or equal to version. <version Less than version. <=version Less than or equal to version. ~version Greater than or equal to version, but less than the next major version.
- Newest version available. “” Newest version available. version – version Greater than or equal to version , and less than or equal to version . 1 2 1 2 range || range Matches versions specified by either range and range . 1 2 1 2 npm commands. Based on Table , all of the commands in Listing 2-5 are valid
Listing 2-5. npm install Commands Using Relational Version Range Descriptors
Various $ npm install commander@"=1.1.0" $ npm install commander@">1.0.0" $ npm install commander@"~1.1.0" $ npm install commander@"*" $ npm install commander@"" $ npm install commander@">=1.0.0 <1.1.0" $ npm install commander@"1.0.0 - 1.1.0" $ npm install commander@"<=1.0.0 || >=1.1.0"
Installing from URLs
In addition, npm allows packages to be installed directly from git URLs. These URLs must take on one of the forms shown in Listing 2-6. In the listing, commit-ish represents a tag, SHA, or branch that can be supplied as an argument to git checkout. Note that the links in the example do not point to any specific git projects.
■ Note You do not need to understand and GitHub to use Node. However, most Node modules use the GitHub
git
ecosystem for source control and bug tracking. Although GitHub and its use are well outside the scope of this book, it is
highly advisable to become familiar with it.Listing 2-6.
git URL Formats Supported by npm
git+ssh://user@hostname:project.git#commit-ish git+ssh://user@hostname/project.git#commit-ish git+
Packages can also be installed from tarball URLs. For example, to install the master branch of a GitHub repository, use the syntax shown in Listing 2-7. Though this URL does not point to an actual repository, you can experiment by downloading the commander module:
Listing 2-7. Installing a Tarball from a GitHub Repository
$ npm install
Package Locations
When packages are installed, they are saved somewhere on your local machine. Typically, this location is a subdirectory named node_modules within your current directory. To determine the location, use the command npm root. You can also view all the installed modules using the npm ls command. After installing the commander module, you can verify that it exists using npm ls. For the purposes of this example, install version 1.3.2. Listing 2-8 shows that commander version 1.3.2 is installed. Also, notice that a module named keypress is installed. The tree structure indicates that commander depends on the keypress module. Since npm is able to recognize this dependency, it automatically installs any required modules.
Listing 2-8. Listing All of the Currently Installed Packages Using npm ls
$ npm ls /home/colin/npm-test
└─┬ [email protected] └── [email protected]
node_modules subdirectory. In this example, commander You can also see the installed modules by browsing the node_modules/commander, and keypress is installed in node_modules/commander/node_modules/ is installed in keypress. If keypress had any dependencies, they would be installed in yet another node_modules subdirectory keypress directory. under the
Global Packages
Packages, as described thus far, are libraries that are included in your program. Referred to as local packages, these must be installed in every project using them. Another type of package, known as a global package, needs to be installed in only one location. Although global packages typically do not include code libraries, they can. As a rule of thumb, PATH environment variable. global packages normally contain command line tools, which should be included in the
To install a package globally, simply issue npm install with the -g or --global option. In fact, you can process global packages by adding the -g option to most npm commands. For example, you can view the installed global packages by issuing the command npm ls -g. You can also locate the global node_modules folder using the npm root -g command.
Linking Packages
Using npm, you can create links to local packages. When you link to a package, it can be referenced as if it were a global package. This is especially useful if you are developing a module and want another project to reference your local copy of the module. Linking is also useful if you want to deploy your module without publishing it to the public npm registry.
Package linking is a two-step process. The first step, creating the link, is done by changing to the directory of the project you want to make linkable. Listing 2-9 shows how to create a link to your module, assuming that your module is located in foo-module. After executing the npm link command, verify that the link was created using npm ls -g.
Listing 2-9.
Creating a Link Using npm link $ cd foo-module $ npm link The second step in module linking, actually referencing the link, is very similar to a package installation.
First, change to the directory of the project that will import the linked module. Next, issue another npm link command. However, this time you must also specify the linked module’s name. An example of this procedure is shown in Listing 2-10. In the example, the foo-module link from Listing 2-9 is referenced from a second module, bar-module.
Listing 2-10. Referencing an Existing Link Using npm link
$ cd bar-module $ npm link foo-module
Unlinking Packages
The process for removing linked modules is very similar to the process for creating them. To remove a linked module from an application, use the npm unlink command, followed by the name. Listing 2-11 shows the command for removing the linked foo-module from bar-module.
Listing 2-11.
Removing a Reference to a Link Using npm unlink $ cd bar-module $ npm unlink foo-module
Similarly, to remove a link from your system, change to the linked module’s directory, and issue the npm unlink command. Listing 2-12 shows how to remove the foo-module link.
Listing 2-12. Removing a Linked Module Using npm unlink
$ cd foo-module $ npm unlink
Updating Packages Since any package that is actively developed eventually releases a new version, your copy will become outdated.
To determine if your copy is out of date, run npm outdated in your project directory (see Listing 2-13). In the example, which assumes that an outdated version 1.0.0 of commander is installed, npm indicates that the latest version is 2.0.0 but that your copy is only 1.0.0. Listing 2-13 checks all of the local packages. You can check individual packages by specifying their names, and you can process global packages by specifying the -g option.
Listing 2-13. Displaying Outdated Packages Using npm outdated
$ npm outdated npm http GET npm http 304 [email protected] node_modules/commander current=1.0.0
To update any outdated local packages, use the npm update command. Much like outdated, update works on all local packages by default. Again, you can target individual modules by specifying their names. You can also update global packages using the -g option. In Listing 2-14, npm updates itself using the -g option.
Listing 2-14.
Updating npm Using npm update $ npm update npm -g
Uninstalling Packages
To remove a package, use either the npm uninstall or npm rm command (the two commands can be used interchangeably), and specify one or more packages to be removed. You can also remove global packages by providing the -g option. Listing 2-15 shows how to remove the commander module using npm rm.
Listing 2-15. Uninstalling commander Using npm rm
$ npm rm commander
The require() Function
As shown in the previous section, Node packages are managed using npm. However, to import modules into your programs, the require() function is used. require() accepts a single argument, a string specifying the module to load. If the specified module path exists, require() returns an object that can be used to interface with the module. If the module cannot be located an exception is thrown. Listing 2-16 shows how the commander module is imported into a program using the require() function.
Listing 2-16.
Using the require() Function var commander = require("commander")
Core Modules
Core modules are modules compiled into the Node binary. They are given the highest precedence by require(), meaning that in the event of a module-naming conflict, the core module is loaded. For example, Node contains a core module named http, which, as the name implies, provides features for working with the Hypertext Transfer Protocol (HTTP). No matter what, a call to require("http") will always load the core http module. As a side note, the core modules are located in the lib directory of the Node source code. File Modules
File modules are non-core modules loaded from the file system. They can be specified using absolute paths, relative paths, or from the node_modules directory. Module names that begin with a slash (/) are treated as absolute paths. For example, in Listing 2-17, a file module, foo, is loaded using an absolute path.
Listing 2-17. A File Module Import Using an Absolute Path
require("/some/path/foo");
■ Caution Some operating systems such as Windows use a case-insensitive file system. This allows you to write
require("commander") , require("COMMANDER") , or require("CoMmAnDeR") . However, on a case-sensitive file system
such as Linux, the last two calls would fail. Therefore, you should assume case sensitivity, no matter what operating system you're using.
Node also supports Windows-style file paths. On Windows, Node allows the slash and backslash characters ( / and \) to be used interchangeably. For the sake of consistency, and to avoid escaping the backslash character, this book primarily uses Unix-style paths. However, be aware that all the paths shown in Listing 2-18 are valid on Windows.
Listing 2-18. Example Module Paths Valid on Windows
require("/some/path/foo"); require("C:/some/path/foo"); require("C:\\some\\path\\foo"); require("\\some/path\\foo");
. or ..) are interpreted as relative paths—that is, they are Module paths that begin with one or two dots ( require(). Listing 2-19 shows three examples of relative module paths. considered relative to the file that called foo is loaded from the same directory as the calling script. In the second, foo is located in the In the first example, foo is located in a subdirectory, sub, of the calling script’s directory. calling script’s parent directory. In the third, Listing 2-19.
Example Module Imports Using Relative Paths require("./foo"); require("../foo"); require("./sub/foo");
If a module path does not correspond to a core module, an absolute path, or a relative path, then Node begins searching in node_modules folders. Node begins with the calling script’s parent directory and appends /node_modules. If the module is not found, Node moves one level up the directory tree, appends /node_modules, and searches again. This pattern is repeated until the module is located or the root of the directory structure is reached. The example in Listing 2-20 assumes that a project is located in /some/path and shows the various node_modules directories that would be searched, in order.
Listing 2-20. Example of the Search Order of node_modules Directories
/some/path/node_modules /some/node_modules /node_modules
File Extension Processing
If require() does not find an exact match, it attempts to add .js, .json, and .node file extensions. As mentioned in
Chapter 1, .js files are interpreted as JavaScript source code, .json files are parsed as JSON source, and .node files are treated as compiled add-on modules. If Node is still unable to find a match, an error is thrown. It is also possible to programmatically add support for additional file extensions using the built-in require.extensions object. Initially, this object contains three keys, .js, .json, and .node. Each key maps to a function that defines how require() imports files of that type. By extending require.extensions, you can customize the behavior of require(). For example, Listing 2-21 extends require.extensions such that .javascript files are treated as .js files.
Listing 2-21.
Extending the require.extensions Object to Support Additional File Types require.extensions[".javascript"] = require.extensions[".js"]; You can even add custom handlers. In Listing 2-22, .javascript files cause require() to print data about the imported file to the console.
Listing 2-22. Adding a Custom Handler to the require.extensions Object
require.extensions[".javascript"] = function() { console.log(arguments); };
Caution Though this feature has recently been deprecated, the module system API is locked, so ■
require.extensions
is unlikely to ever disappear completely. The official documentation recommends wrapping non-JavaScript modules in another Node program or compiling them to JavaScript a priori.
Resolving a Module Location
If you are interested only in learning where a package is located, use the require.resolve() function, which uses the same mechanism as require() to locate modules. However, instead of actually loading the module, resolve() only returns the path to the module. If the module name passed to resolve() is a core module, the module’s name is returned. If the module is a file module, resolve() returns the module’s file name. If the Node cannot locate the specified module, an error is thrown. The example in Listing 2-23 shows usage of resolve() in the REPL environment.
Listing 2-23. Locating the http Module Using require.resolve()
> require.resolve("http"); 'http'
Module Caching
A file module that is loaded successfully is cached in the require.cache object. Subsequent imports of the same module return the cached object. One caveat is that the resolved module path must be exactly the same. This is so because a module is cached by its resolved path. Therefore, caching becomes a function of both the imported module and the calling script. Let’s say your program depends on two modules, foo and bar. The first module, foo, has no dependencies, but bar depends on foo. The resulting dependency hierarchy is shown in Listing 2-24. Assuming that foo resides in the node_modules directory, it is loaded twice. The first load occurs when foo is resolved to the your-project/node_modules/foo directory. The second load occurs when foo is referenced from bar and resolves to your-project/node_modules/foo/node_modules.
Listing 2-24. foo Is Referenced Multiple Times
A Dependence Hierarchy Where your-project
├── [email protected] └─┬ [email protected] └── [email protected]
The package.json File In an earlier section you saw that npm recognizes dependencies between packages and installs modules accordingly.
But how does npm understand the concept of module dependencies? As it turns out, all of the relevant information is stored in a configuration file named package.json, which must be located in your project’s root directory. As the file extension implies, the file must contain valid JSON data. Technically, you do not need to provide a package.json, but your code will essentially be inaccessible to npm without one.
The JSON data in package.json is expected to adhere to a certain schema. Minimally, you must specify a name and version for your package. Without these fields, npm will be unable to process your package. The simplest package.json file possible is shown in Listing 2-25. The package’s name is specified by the name field. The name should uniquely identify your package in the npm registry. By using npm, the name becomes part of a URL, a command line argument, and a directory name. Therefore, names cannot begin with a dot or an underscore and cannot include spaces or any other non-URL-safe characters. Best practice also dictates that names be short and descriptive and not contain “js” or “node”, as these are implied. Also, if you plan to release your package to the general public, verify that the name is available in the npm registry.
Listing 2-25. A Minimal package.json File
{ "name": "package-name", "version": "0.0.0" } version field. The version, when combined with the name, provides a truly
A package’s version is specified in the unique identifier for a package. The version number specifies the major release, minor release, and patch number, npm allows versions to begin with a v character). You can also specify a build number by appending separated by dots ( a tag to the patch number. There are two types of tags, prerelease and postrelease. Postrelease tags increase the version number, while prerelease tags decrease it. A postrelease tag is a hyphen followed by a number. All other tags are prerelease tags. The example in Listing 2-26 shows version tagging in action. Several tagged versions and an untagged version (0.1.2) are listed in descending order.
Listing 2-26.
Several Tagged Versions and One Untagged Version Listed in Descending Order 0.1.2-7 0.1.2-7-beta 0.1.2-6
0.1.2 0.1.2beta
Description and Keywords
The description field is used to provide a textual description of your package. Similarly, use the keywords field to provide an array of keywords to further describe your package. Keywords and a description help people discover your package because they are searched by the npm search command. Listing 2-27 shows a package.json excerpt containing description and keywords fields.
Listing 2-27. Specifying a Description and Keywords in the package.json File
"description": "This is a description of the module", "keywords": [ "foo", "bar", "baz" ]
Author and Contributors