Introduction to Perl

Testing

G. Wade Johnson

Houston.pm / cPanel, Inc.

notes

What is Testing?

A way to ...

catch bugs?
improve the quality of code?
improve the design of the code?
prove you are Agile?

The word testing means different things to different people. In many cases, testing is just a means to an end. When we are talking about testing in Perl, we are actually talking more about a development philosophy.

Ad Hoc Testing

Exercise the code by hand to see if it breaks

There is nothing wrong with ad hoc testing, unless it is the only form of testing you use. This approach (obviously) requires a lot of person-hours to perform. It is also not really suitable for use in verifying code repeatedly. It is very repetitious, which generally means error-prone when executed by humans.

Automated Testing

Have the computer do the boring work.

Most of the testing we want to talk about is automated. When the procedure is to run the same test day after day, you really want to hand the test off to a computer. This makes the test more reproducable and, quite frankly, the computer does not get bored and wander off topic.

Automated Testing Goals

Management: Measurable code quality
QA: Regression protection
Development:
- Design exploration
- Interface verification
- Troubleshooting support

Each stakeholder has different things she wants from any testing. Good automated testing can actually provide all of it, with less expense that you assume.

How to Test?

There are many Perl testing frameworks.

Test::Class
Test::Unit
Moose::Test
Test::FIT
Test::Cukes

Like many things in Perl, thre are many different frameworks that provide different approaches to solving the problem. Since our time is short, I won't focus on these.

We will focus on the most widely used and standardized system in the Perl ecosystem: TAP.

Test Anything Protocol (TAP)

    1..4
    ok 1 - Initial sanity verified
    ok 2 - Sanity still exists
    not ok 3 - Sanity has left the building
    #   Failed test 'Sanity has left the building'
    #   at examples/sanity.t line 11.
    #          got: '4'
    #     expected: '5'
    ok 4 - Sanity has been restored
    # Looks like you failed 1 test of 4.

This is a very simple example of the output from a TAP-based test. It shows all of the major parts of a TAP test.

Marker indicating the number of tests to run
A series of test assertions.
- Success indicator: ok or not ok
- Assertion number
- Assertion name/label
Diagnostic information preceded by a #

Test::More

#!/usr/bin/perl

use Test::More tests => 4;

use strict;
use warnings;

my $var = 1;
is( $var, 1, 'Initial sanity verified' );
like( $var, qr/^\d+$/, 'Sanity still exists' );
is( 2+2, 5, 'Sanity has left the building' );
isnt( 2+2, 5, 'Sanity has been restored' );

The standard Test::More module provides a more complete set of assertion functions. These don't really do much more than the straight ok() function (which is also provided) if the assertion succeeds. But, if the assertion fails, they provide more useful diagnostic output.

Test::More Assertions

ok() - success if first argument is true
is() and isnt() - equality
like() and unlike() - regex match
cmp_ok() - arbitrary binary comparison
is_deeply() - compare arbitrary data structures
isa_ok() and can_ok() - objects and classes

These are the major test assertion functions provided by Test::More. The module does provide other functionality, but we will not be exploring that in an introduction.

More Test Modules

Perl has a lot of test modules that work with TAP

Test::Differences
Test::Deep
Test::LongString
Test::Exception or Test::Fatal
Test::Warn
Test::NoWarnings

You are not limited to Test::More. There are a large number of specialized modules that provide other assertions and extend functionality. All of these still output TAP, so they are compatible with any tools that would recognize that output.

Why Test?

You still might ask why you should test. Technical solutions are great, but why should you apply them?

Robust Code

Testing cannot find all bugs, but not testing cannot find any bugs.

Much has been written on the inability of testing to find all of the bugs in software. Likewise, testing cannot prove that the code is correct and bug-free. But, testing does provide a link in a chain of evidence.

And, not testing provides even less evidence of correctness.

Documentation

Well-written tests show how the module should be used.

I have worked in environments where the documentation group was able to answer a lot of their own questions from well-written tests. That meant that they only needed to interrupt the development team for the questions that really mattered. This is a better use of time for both teams.

Interface Design

Writing tests makes you use an interface.

Nothing makes you appreciate the awkwardness of your interface like forcing you to use it.

Regression Protection

Any bug that is covered by a unit test will not come back
(without you knowing).

If you add a unit test for each bug that you find, the bug cannot reappear due to later changes in the code without triggering the test.

What to Test

"Anything that can break"
Happy path
Edge/corner/boundary cases
Failure cases (bad args, etc.)
Fundamental assumptions/assertions

The old Extreme Programming methodology used the first approach to decide what should be tested.

Most people remember to test the obvious success case. It is equally important to look for places on the edges of good behavior or changes in behavior and test around those areas as well. Edges are a rich source of bugs.

Fewer developers remember to test the failure cases. Do you know how the code acts if it is passed a bad parameter? How about if a file it depends on does not exist? How about permission failures? Testing these areas makes certain code is in place to handle the sorts of surprises that happen in the real world. These corners of code are often never tested except by unit tests. So the first time they may execute is in production.

Almost nobody remembers to test their assumptions. These assertions almost never fail. When they do, they tend to generate very strange behavior that may take days to unravel.

Example Unit Tests

Look at some test code.

The supplied code is not production. List.pm is an example that is complex enough to have reasonable tests and simple enough that you won't be distracted by the problem being solved. See List::Util for the real solution to this problem.

The code is pretty straight-forward. The first test file shows some minimal testing that someone might apply. The final test is somewhat more complete.

More Resources

Test::Tutorial
Test::More documentation
Perl Testing: A Developer's Notebook
— Ian Lanworth, Chromatic

The images in the slides are from the Open Clip Art Library.

Unit Tests for Coverage

The Devel::Cover module helps determine what code is executed.

For example List.pm unit test coverage.

The confidence given by your unit tests can only be as good as your coverage. Using Devel::Cover you can measure the degree of confidence you should have in your unit tests.

In a real system, different modules will have different levels of coverage and, therefore, different levels of confidence.

Running Tests

Run test script normally
Batch testing using prove command
make test
./Build test

All Test::More-based test files are Perl scripts that can be executed directly.

Using the prove command supports running multiple test scripts at a time. More importantly, prove summarizes the tests in a much nicer fashion.

Special Test Handling

Skipping tests
Todo tests
Bail out of test suite

Sometimes you have test assertions that you do not want to run in some circumstances. For example, if you have a multi-platform module, the MS Windows-specific tests don't need to run under Mac OS X. Test::More provides a nice mechanism for skipping tests, with a message telling why and still running in a way that prove gives a good summary.

TODO tests are great for adding a test for something that you know needs to be done, but that you cannot fix right this moment. You could leave a failing test, but having your tests always successful is a really good habit. The TODO mechanism allows the test to fail, but reports it as a success for the test run. The message reminds you what is happening so you don't forget. Just as importantly, prove gives a special indication when a TODO test starts passing unexpectedly.

On rare occasions, you can find yourself in a state where no further testing can happen. (Hardware to test is missing, no hard drive, etc.) Rather than fail all of the tests one at a time, Test::More provides an ability to stop a full test run by Bailing out.

QA: Web Application Testing

A good choice for testing a Web UI is WWW::Selenium.

See the talk Testing with WWW::Selenium at the Houston.pm website for a talk we had on this subject.

To answer a great question from the talk. UI testing was not in the scope of the talk. But, WWW::Selenium would be a good place to look tools to solve this problem.

QA: Where should you put your tests?

The convention is to have a t/ subdirectory and put all of your tests there.

Some people put them under tests/, some dump them in the main directory. Organizing them is a better idea in the long run, but the important thing is to have them and run them.

Kinds of Testing

Smoke Testing
Unit Testing
Functional Testing
Performance Testing
Coverage Testing
Regression Testing
Exploratory Testing
Usability Testing

There are many kinds of testing. The list above gives some terms that are worth learning about. All but the last two are automatable.

YAPC::NA

June 27, 2011