Programmer Musings

Review of The Art of Readable Code

2015-09-03T19:50:34Z

The Art of Readable Code
Dustin Boswell and Trevor Foucher
O'Reilly Media, Inc., 2012

When I saw this book, it seemed like a great idea. I've spent my entire career with legacy codebases, trying to make them better. A book that covers ways to make code more readable, and therefore, more maintainable is a wonderful resource.

The book starts with a discussion of small changes that you can make to improve code: naming, aesthetics, and commenting. The authors do a good job of covering what makes a good name and how careful naming can really help the readability of the code. They touch on the lengths of names and manage not to make the mistake of so many by mandating that long names are always best. In the section on aesthetics, they discuss how the look of the code can help with understanding, similar structure conveying similar intent. They even spend a lot of time on making comments pull their own weight and what makes a good comment.

The second section covers structuring control flow and expressions for readability. In the process, they touch on but don't really go into the importance of idioms for making code more recognizable even when you can't read it in depth. Despite that lack, they make a good case for standardizing sections of code into similar forms and simplifying loops and conditionals. Since complicated code structure is hard to read, simplifying these structures improves readability.

The third section moves out of small-scale changes and begins talking about larger restructuring to improve code. In this section, they finally mention some standard forms of refactoring. They also advocate thinking a bit more about what you are trying to accomplish before writing the code. They do not push for full scale BDUF, but instead focus on thinking through subroutines and chunks of code to make certain that you understand what you are doing before writing a bunch of code.

The final major part of the book covers testing and one large case study showing how to apply what you've learned in the book. In the case study, they start with a sub-optimal solution to a problem and make it better and faster while improving its readability. They come very close to implying that the readability is the cause of the performance increase without quite going there.

Overall, I found the book to be quite readable. They did not go as far as I would have in a few areas. They used several programming languages for their examples. For each of those languages, they focused on how to use the idioms of the language to make their point. That handling of languages makes one of my disappointments with the book stand out. In the section on control flow, they make the statement:

Many respected programming languages, as well as Perl, have a do { expression } while (condition) loop.

The only mention of any language that they were not using in their examples is an snarky reference to Perl. I understand that many people really don't like the language, but the random sniping does get old. I was especially amused when they praised the newer languages on their list for features that have been in Perl for decades. That suggests that they may never have programmed in the one language they bashed.

Despite that one piece of snark, I would recommend this book to a junior programmer trying to learn to code. For intermediate or senior level programmers, this book could be a start, but I would expect to go further into idioms, choice of audience, code smells, and other issues relating to high quality code.

Secure Development: Threat Models

2015-09-01T13:55:45Z

There are numerous issues that you need to consider when developing almost any software. If you are working on software that connects to a network in any way, security is yet another thing that you need to consider.

To introduce this series on Designing Secure software, I'm going to talk about something that normally gets left out of discussions about security: threat assessment.

But first, let's go over how security discussions usually play out...

The Attack

Many companies, or even just development groups, don't think about security until one of a few things happens:

They get attacked
One of their customers is attacked (and it might be their fault)
A competitor gets attacked
A big-name company somewhere (Target, etc.) is attacked

In the first two cases, the result is normally yelling at the development staff to find out why they didn't make things secure. The other two cases normally start with an emergency meeting to ask Are we safe from that?

In most cases, little to no thought was spared up front for security. Everyone was focused on features, usability, look, and other issues that seem to translate directly into dollars. As usual, hidden issues don't get much attention unless they go wrong.

Once the emergency happens, the powers-that-be want the development staff to smear some security on the system to protect from attack. The big problems with this approach are:

There's no thought about what secure means
There's no thought what we need to be secure from
It doesn't work

The Secure System

I'll cover the myth that a system can be absolutely secure in a later post. Suffice it to say that if money is no object, a sufficiently powerful and motivated attacker can get into any system.

Identifying Attackers

Which leads to the second point, what kind of attackers are you trying to protect against? To a large extent, the kinds of attackers you expect determines the kinds of attack you are likely to see. Some of the possible attackers you might need to protect against include:

Script kiddies
Bored students
Disgruntled former employees
People with a grudge against the company
People with a grudge against an executive in the company
Competitors
Fledgling hackers practicing their skills
Disreputable companies selling security solutions
Small hacking groups looking for fun or reputation
Large hacking groups with an ideological or political agenda
Foreign companies looking for an advantage
Criminal organizations looking to make money
Nation-funded hackers looking for political advantage
Law enforcement agencies investigating your company
Law enforcement agencies investigating your customers
Three-letter agencies trolling for possible persons of interest

This is not a complete list, but it does a fair job of covering the range of attackers you might face. The kinds of attackers you expect determines what kinds of security you need.

What Applies to You?

If your service serves a community of people who discuss different varieties of carnations, you are not likely to be targets of organized crime, Chinese hackers, or the FBI. There's not much reason for a large hacking group to go after your community. You might be the target of vandalism or someone trying to plant malware on your site, but those attacks are a much different caliber than a high-end, organized attack.

You need to determine what is important about your site that you need to keep safe. Are you storing passwords, credit card numbers, or personal identification information on your clients? You probably need to be more careful than if you are storing a user-chosen alias. If you are storing serious financial information, you need to be even more secure.

What would happen if someone gained access to all of the information you have on your customers? Would it be embarrassing, a source of financial difficulties, or life-threatening?

What Matters?

Let's look at a few scenarios to see how you might make some decisions.

Carnation Community

Let's start with our example from the last section. Say you have a site that supports a group of flower enthusiasts chatting about carnations. What you have determines who might attack and how.

If you have a username and password for each user to allow them to post, there are a small number of possible attacks. The first is malware or malicious links. Anywhere your users are allowed to post something that you can display will need some level of protection from this treat. But, this applies to almost all sites. The important part is what is special about your site.

Given the known breaches that have released large numbers of passwords, your passwords are a target. Since, in this example, there is no email address or real name associated with the account, the passwords are only mildly valuable.

If your passwords are not stored in the clear, the biggest threat to your system is people hacking in and posting something that harms the reputation of a user in your system.

This would probably not attract the attention of anyone with a large amount of resources. Although, you might have to watch out for the sunflower hacking squad. They might want to deface your pages as part of their ongoing campaign to take the most popular flower spot.

A Bit More Tempting

Let's say we increase the amount of information on the carnation chat site. In addition to the username and password, let's say you include contact information: real name, physical address, and email address. You use this to send personalized offers for flower shops.

This makes the passwords more valuable (many people have one email address and use the same password on multiple sites). The email address is always of interest to spammers. The physical address and real name together gives more information for potential identity theft.

Notice how a small amount of information significantly increases the potential threats.

The Carnation Store

Say the site has added the ability to buy carnations and have them shipped to you. If you store credit cards, you have just become the target of larger groups. Organized crime, larger hacking groups, small-timers trying to make a reputation will all be interested. You now have something that translates directly to cash.

Summary

The more advanced the attacker, the more effort and expense will be needed to secure the software. Identifying who is likely to attack your software allows you to provide reasonable security without spending too much effort.

What you have and what you do determines who would be interested in attacking you. Identifying potential threats determines how much effort you need to put into securing your (and your users) information.

Putting too much effort into security, which could cause a project to miss a deadline or fail completely.

BPGB: (Dis-)Integration Branches

2015-08-25T13:46:25Z

This is another post in my intermittent series of Best Practices Gone Bad (BPGB)

Today, we are going to take another side-step into version control. Most development groups use version control of some form. Whether you prefer Subversion, Git, Mercurial, Bazaar, Clear Case, or any of the many others, version control is an important technique for keeping your changes under control. This is especially true if you are maintaining multiple releases concurrently or have more than a couple developers on your team.

Back in BPGB: Feature Branch Fail, we covered what happens when branches live for too long. When you need to merge multiple long-running branches, you increase the probability of conflicts.

Integration Branch

In order to prevent these kinds of conflicts from messing up the main branch, many people discover the idea of an integration branch. You branch from the main line, merge multiple feature or bug fix branches into this integration branch, and fix any conflicts there. When the integration branch is clean and survives the tests, you merge the branch back to main.

This approach seems pretty reasonable and usually solves the first set of problems that people have with branch conflicts. Although the conflicts still exist, the integration branch gives us the time to resolve conflicts without leaving the main branch broken. If the conflicts take time to merge, we don't have main in a broken state. If the conflicts are too overwhelming, we have an easy way to back out. Maybe merging branches in a different order will make the conflict resolution easier. In any case, we have a few more options. Life is good.

Then, someone has the idea that recreating the integration branch each time we want to do this is a waste. The obvious approach is to leave the integration branch around and just keep it in sync with main. Although this seems reasonable, we have just turned the integration branch into the equivalent of a long-lived feature branch. as we found in the previously mentioned post, this tends to result in worse conflicts and pain.

If the integration branch is not kept in sync with main, there is a real possibility of problems when integration is merged to main. I've also seen situations where someone decides that the integration branch is obviously more up-to-date and overwrites (force push) the main branch, potentially posing changes that had already been merged. This becomes the same kind of issue that we were trying to solve with the integration branch in the first place.

Key to making the integration branch strategy work is that this branch starts out identical to your main branch before you begin merging. If there is any difference at all, you court the possibility of doing a bunch of work to get the integration branch functional, only to have the same problems again when you merge to the main line.

Summary

Most version control tools provide methods for maintaining and merging multiple lines of development. Despite the fact that the tools have become increasingly good at recognizing and resolving simple conflicts, human intervention may still be required. Care is needed to make sure that you reduce the effort needed to make changes rather than just move the effort.

One real anti-pattern for version control is long-lived branches. There are a few cases where it makes sense. But, they are a lot rarer than people believe. Don't ever make a long-term integration branch solely to save the time of setting up and tearing down this branch as needed. The pain will quickly outweigh the minor benefit.

Review of Release It!

2015-08-13T20:40:30Z

Release It!
Michael T. Nygard
Pragmatic Bookshelf, 2007

I've had this book on my shelf for a few years, and finally got some time to start reading it. I should not have waited.

Nygard takes the position that the life of a piece of software actually only begins when it is released. He spends a lot of time on the way things go wrong in production that you won't see in a development environment. Anyone who has ever survived a push to release and been surprised that there's no time to relax will really appreciate this book.

The book contains a large number of war stories showing things going wrong in real projects. Some of the failures are obvious, others are surprising. Nygard distills these failures down to some anti-patterns that can cause problems with stability or capacity. Then, he provides design patterns that can mitigate or eliminate some of the problems in release.

Some of these design patterns seem obvious: Use Timeouts or Pool Connections. Others are less familiar: Circuit Breaker or Bulkheads. Like the patterns from the Gang of Four book, a large part of the benefit of the patterns is having common names and descriptions of the patterns. This applies whether you have been using them for years or have seen them for the first time. If you are seeing them for the first time, his descriptions are good enough that you should quickly understand the pattern.

The section on general design principles is very effective, but the part that most developers really need to read is the section on Operations. Too often, those of us developing software forget what the operations people need from the software. When we think of them at all, we try to provide a nice interface for a few admin tasks. Nygard points out that a pretty interface is nowhere near as useful as a scripting or command line interface. He also describes examples of how this kind of approach actually helps operations.

Overall, this is a really great book for anyone that is releasing software that must run in a production environment. If you fall in this camp, you should get the book and read it before your next project.

LCDC: Rising Tide

2015-08-12T00:04:01Z

I began this series on Least Common Denominator Code (LCDC) with The Myth of Code Anyone Can Read. In the posts that followed I attempted to show how this kind of code is not actually possible in a real development shop. I've shown how different code should be written for different audiences, and how real business value drives code that cannot be written for the least common denominator.

In this post, I propose to show why, even if it were possible, LCDC would be a bad idea.

Expertise

Different developers have different kinds and levels of expertise. Unless you only have one developer on staff, you have probably seen that some developers are more productive, or faster, or safer than others. You may have one developer that always works on the critical code because she is the most careful or security conscious. A different developer may be tasked with time-critical issues, because he can always get a 90% solution in place in short order, even if other developers are needed to finish the work.

These developers have different skills and expertise, as well as temperament, that determines how the company can best make use of them.

However, you don't want the developers on your team sitting still. The rest of the industry is continually learning new tools and techniques. They are exploring other ways of solving problems. If your competitors are improving and your team is not, your company will fall behind. This could cause the project (or company) to fail.

On a related note, the kind of people who become software developers usually like to learn. They are probably spending some time looking into new technologies and related fields. The good ones will be exploring on their own to improve their skills. If one of your good developers finds an interesting tool or technique, they are going to want to use it.

On the other hand, there are developers who just code as a job. Just like any field, there is a spectrum of skill and motivation. These are the people that may just be doing the same thing year after year. As a manager or mine used to say:

There a big difference between 5 years of experience and 1 year of experience 5 times.

If you want to keep up or surpass your competition, you need more of the former than the latter.

Developer Motivation

Many people have written about what motivates software developers.

In most of these discussions, you will find some variant of interesting problems and opportunity to learn or improve. The best developers seek out positions with with these features. They also like to be recognized by other developers for their expertise. LCDC fights against all three of these motivational factors. If a really good developer has no or few interesting challenges, is not allowed to learn more effective ways to solve problems, and cannot show off new techniques to the other developers, their main motivation is to find a new job.

I have heard managers argue in the past against training for their developers, because they might learn enough to want to leave. One trainer I met answered with What if your developers don't learn anything, and then stay? As I pointed out above, developers outside your company are learning, if your team is not, they are falling behind.

The Rising Tide

The open source community is fond of the phrase:

A rising tide raises all boats.

The idea is that if we share our expertise the whole industry gets smarter and more capable. You can take advantage of this inside your company or team as well.

Instead of aiming to keep your code written to the level of your most junior developer, aim for just above the middle level of expertise on your team. Anyone below that level should be encouraged to learn. The developers with more expertise will be looking for ways to improve the overall team ability, so that they can show off their skills.

The downside of this approach is that new people will have a lot to learn. But, the really good developers like to learn. That's a large portion of why they are in this field. The upside is that the developers who want to learn and improve will stay with the team. The ones who refuse to learn will fall behind (and might leave). The overall result is a more capable, more experienced team.

Summary

The point is that LCDC is quite likely impossible. Every bit of experience in your code, fights against the idea of LCDC. The more you look, the more likely it is to be bad for your project. Not only is this a bad idea from a technical point of view, but it also tends to chase off your best developers.

This whole series of posts is aimed to convince you that LCDC is a really bad idea.

LCDC: Different Audiences Have Different Needs

2015-08-03T03:00:12Z

In the last few posts, beginning with The Myth of Code Anyone Can Read, I've focused on what you can expect from your programmers in general. Of course, generalizing is what got us into this discussion in the first place, so let's spend a little time not generalizing.

When teaching new programmers, I always tell them that the code had at least two audiences: the computer and the next programmer. This is not actually true, depending on the purpose of the code, it may have many audiences. For instance,

Code used as an example to show how to use an API, may be used by almost any level of developer. They may not have any experience with your system and may be relatively junior.
Code used as examples for writing plugins for your system can expect that the reader has some level of familiarity with the way your system works, if only at a superficial level. Since plugins normally have a tighter integration with your system than an API call, the developer can be expected to learn a little more before doing development.
Most of the code in any given system is non-critical. If non-critical code breaks, it is annoying and possibly embarrassing, but it won't result in loss of data or compromise of a customer. Since it is inside the system, any developer working on it will have at least the level of expertise that the manager hires for. If every new hire is expected to have three years development experience, the code can assume that level of experience.
Core business code is the part that makes the money. The programmers that are working on that code should be at least the level of experience you hire for, and probably more. They should be familiar with the business and terminology, so that they can be expected to understand the jargon.
Core library code is the foundation upon which all of your system sits. Some of this will be libraries that you have gotten from a third party and just use. Some will be the low-level routines written by your senior developers and used by the rest of the team. This code requires deep knowledge of the domain of that library. The only people working on it will be experts in that particular domain. Only the most senior people should be making changes in this code.

Levels of Developer

Let's start with a fundamental rule:

Expertise is not evenly distributed.

This rule is why programmers (and people in general) are not interchangeable. You cannot expect to replace the senior programmer, who has 5 years of experience in your code base, with an entry-level programmer straight out of a computer science degree and expect the same level of productivity.

I have had some people suggest that this is an elitist approach to development. As a really new programmer, I would have been insulted by someone suggesting that I wasn't qualified to work on a particular piece of code. In actuality, this is really more of a pragmatic view of development than an elitist one.

The truth is that developers in the real world have different levels of experience, along many different dimensions. Some are experts in the language, but know nothing about your business. Some know about your business and have never written more than a 10 line program. Most are somewhere in-between.

API User

Let's take the first category, people who use our APIs are completely outside our control. We have no knowledge of who they are or what they have worked on. For this reason, we pretty much have to assume LCDC as a requirement. This is as close to random person walking in off the street as you should get. Anyone who has ever supported an API has stories of people misusing the API for many reasons including:

Using the API for the wrong business.
Trying to perform an action that is not valid for the industry
Using the API to violate the laws of physics
Trying to use the API from an unsupported language
Trying to use the API with unsupported hardware

My favorite example of this was a system I supported that displayed charts of stock price information. We had a customer trying to use our interface that reported a bug. No matter what he did, he could not get the chart to show data later than today. It took a bit of effort to explain the impossibility of displaying tomorrow's stock price.

You can assume nothing about a user of your external API.

Plugin Developer

Even if the plugin developer may start near the same level as someone who uses your API, they will inevitably become more knowledgeable before they get very far into the development of a plugin. By their nature, most plugins require some understanding of the system. Most of the time, someone has used the program itself and possibly any API for a while before deciding that a plugin is the way to go.

Since developing a plugin requires more expertise and knowledge, it serves as kind of a filter on the kind of developer will attempt it. By structuring your plugin interface for more experienced developers, you can further filter for developers with the kinds of skills you are willing to support.

Business Code

The next two categories of code will mostly only be seen by your developers. Most of this code is not business critical. A large amount of any code base is normal input code, validation, data-massaging, and output. Support for templates, language, user interface, etc. are important, but may not harm your customers in the event of a mistake.

Most business, however, have some code that is core to the business. Some examples include:

makes the money (charging credit cards, making trades, finding oil, etc.)
handles the consistency of the user's data
handles legal requirements
prevents unauthorized access to private user data (credit card number, etc.)

All of that kind of critical code is not immediately turned over to junior developers, because mistakes in those areas could mean failure of the project (or lawsuits or loss of income). The more critical the code is to your business the more likely you are to have it done or, at least, overseen by more senior people. This is simply because your most experienced people are more likely to be aware of the issues that are important in this code.

That Code

Many systems have a piece of code that is core to their business, but is so involved, low-level, or just plain weird that only a few are trusted to touch it. This code normally lives in libraries that any of your developers can use, but few understand. Sometimes it's the result of someone determined to have job security because they are the only one who understands the code. Most of the time the reason is more benign. This code could be:

the result of someone's PhD thesis and only a handful of people on the planet understand it
the system that ensures that data is maintained according to the laws governing your industry
the trade secret that allows your code to do something your competitors can't
the interface code to a piece of hardware required by your system
the registration code that makes certain your company is paid for every copy in use

In most cases, this code is important and a small number of developers have become the experts with that code over time. (Hopefully, that small number is not 1 or less.)

This kind of code often has special jargon all its own that no one new to the system is going to understand. There is really no need to make this code generally understandable. You do not want a junior developer making changes here, since there is a large probability that they are going to break it.

Different Styles for Different Audiences

Different portions of the code assume different levels of expertise from their audience. Trying to write all of the code to the least common denominator will make parts of the code unfit for the designed purpose. An approach that cannot assume some knowledge of the part of the developer will be harder to maintain for people who do have that knowledge.

Realizing the audience for a particular piece of code will allow your team to choose an appropriate level at which to write the code, instead of mandating a bad idea. One other side effect is that someone wandering into code that uses more advanced idioms, that they don't understand, has an indication that they should probably know more before making changes here. It's kind of a you must be this tall to ride the ride sort of marker.

If the code is written as if any idiot can change it, likely one will. This approach only works if your developers are comfortable with the idea that code may be written with different idioms and that the idiom is an indication of expertise needed. It also works better if anyone is allowed to read the code, but only allowed to change it once they have convinced the current maintainers that they have the appropriate understanding.

Summary

Different kinds of code require different levels of expertise. Writing everything as LCDC, hides useful markers of different kinds of code. This could tempt less experienced developers into breaking critical code because they touched something they didn't understand.

In the final post of this series, I'll show that this is not a static situation.

LCDC: Business Logic

2015-07-29T14:12:43Z

In The Myth of Code Anyone Can Read, I introduced the idea that least common denominator code (LCDC) is not a goal anyone should aim for. Despite my assertion, I've seen a number of places where I have seen this as a requirement.

Even if you don't believe any of the other reasons for code to not cater to the lowest common denominator, there is one reason that you can't really ignore.

Business-specific Knowledge

Your particular company or project embodies knowledge that is not universally known. If it didn't, your code would not be worth much, because anyone could reproduce it on demand. As such, a novice walking in off the street will never understand all of your code. They don't have the context to understand it. Often, the logic that makes up the business decisions that the code needs to make are written into your system. There are functions, objects, and algorithms that are particular to your business. There are particular ways of using these fundamental parts to do work that has value to your users.

Terminology

All of this means that there are idioms, terminology, and approaches in your code that novices or really junior people won't understand. Some of this terminology may qualify as jargon. This just means that there are terms that mean something particular in your business that people from another industry or possibly even company may not understand. One of these jargon terms may be a short-cut to an entire paragraph of context. Novices or people from outside your company are likely not to recognize any jargon used inside the code.

If you try to re-write the code so that they can, it will become a morass of low-level details that make any higher-level understanding impossible. Randall Munroe gave us a wonderful example of what happens when you don't use specialized terms in xkcd: Up Goer Five.

Summary

Obviously, you want any code written for your project to be specialized for your business. You expect over time, your developers will learn more about your business. Obviously, that knowledge should be reflected in the code. This insights into your business is what makes your developers able to write code specific to your business.

Next time, we'll explore a specific case why LCDC might be useful for parts of your project.

LCDC: Developer Specialization

2015-07-02T14:10:52Z

In The Myth of Code Anyone Can Read, I introduced the idea that least common denominator code (LCDC) is not a good approach to writing software. Among other things, this approach ignores programmer specialization that happens in any team of more than two developers.

Mental Caching

As a project becomes larger than one person can comfortably keep in their head, maintenance becomes more difficult. A developer has to spend some time re-familiarizing themselves with part of the code before they can do any serious work on it.

Depending on how recently they looked at the code, the developer could take almost no time or several hours re-learning the important parts of the code. (If you think the latter is unreasonable, I've been in code bases where we worked on a bug in code that had not been touched in almost 10 years. It takes a while to get up to speed on that.)

Since most programmers don't want to waste time, they tend to specialize somewhat in the code that they work on. This means that anything in that part of the code they can work on without spending a lot of time remembering or re-learning the code. This happens naturally. After finishing working on a bug or feature in the foo subsystem, they look for another bug or feature in the same system. After all, they are already familiar with it.

As this continues, that developer becomes the fastest person to work on that subsystem, and so bugs and features in that area tend to go to them, naturally.

The Specialist

When you think of development specialists, you probably normally think of someone like a database or UI developer. But, other specializations tend to happen pretty naturally. Watch who works on bugs in which area. See who people ask when they need to work in an area they don't know. Does everyone defer to Nancy on network issues? Is Fred the expert on the logging system? Do we put off working on the reporting framework when Robert is on vacation?

These people have become specialists. They are normally most effective in their area of specialization. Some may still be pretty good in other areas.

Effects of Specialization

As someone becomes more and more familiar with one area of the code, they tend to develop short-cuts and patterns that reduce potential bugs and make the developer more efficient. They may develop a specific metaphor that only applies to this part of the code. They develop jargon terms that supply a lot of context when they see them in the code. These are part of what makes them effective.

Forcing the specialist to write code in a way that everyone can understand would severely limit their effectiveness. They would have no shortcuts to remind them of the patterns they have seen in code. Instead of the code becoming tighter and more focused, it would tend to be more unfocused.

You might think I'm being overly pessimistic here. But, I have actually seen this happen. When a new person takes over the code when a specialist leaves (or is promoted), they tend to undo all of the specializations that they don't understand. The code normally slows down and develops new bugs.

This points to one of the major problems with having specialists. They reduce the project's Bus factor. This problem is relatively easy to solve. Have at least one other developer work with your specialist some of the time in their code to learn the context before it is critical. The second developer does not need to become as much of an expert in that part of the code, but they should be able to understand the thinking that went into it.

Summary

If you have more than one developer on your team, specialization is pretty much inevitable. In general, that is a good thing. Deeper knowledge in a particular area is likely to produce serious benefits over the life of the project. Along with that deeper knowledge will come patterns and idioms that are very specific to the target code. Mandating these patterns and idioms away will destroy the benefits of having a specialist.

In the next post, I'll discuss a kind of specialization that is critical to your project and you can't mandate away.

LCDC: Library Code

2015-06-24T14:25:25Z

In LCDC: Fundamental Knowledge, I explained how hard it is to specify a minimum level of knowledge or experience for all programmers. This minimum level would be needed to determine what is allowable for Lowest Common Denominator Code (LCDC). Anyone who has been programming for any time is probably shouting at the screen, calling me an idiot, because programmers don't really need to know the internals of some of this stuff. We can rely on well-written libraries to handle the hard parts.

I'm going to look at this from two different directions.

Libraries Without Understanding

The problem with assuming that a library hides all of the hard bits is that no library is a perfect abstraction. In some cases, you can ignore the internals. In others, the fundamental properties of the library are more evident.

Misuse of Hashes

In recent years, I've been doing quite a bit of Perl programming. In Perl, as in most of the dynamic languages, one of the fundamental data types is a hash, which is implemented as a hash table. To make sure we are on the same page (because I can't know your background), the following is a list of the important characteristics of a Perl hash.

Consists of strings as keys with and associated scalar which is the value
Given a string, access to the associated value does not depend on the size of the hash (constant time access)
Checking for the existence of a key in the hash is also a constant time operation.

I have regularly seen a pattern in code where a programmer wants to see if a string exists in a large array of strings. So, they use the following approach:

build a hash from the array of strings
check for existence of the string in the hash
discard the hash

They reason that looking up a string in a hash is fast, so this is a good idea. Unfortunately, this is actually slower than doing a straight-forward linear search of the array. If the programmer understood the way hashes worked (and a little bit about algorithmic complexity), they never would have made this mistake.

Random Sorting

In multiple languages, I've seen people use the standard library's sort function by calling the standard rand function for the sorting function to try to randomize an array. Without knowing how sort works under the hood, you may not realize that this can result in anything from a mostly unsorted array to a run that doesn't terminate. (In really unusual cases, it could result in modifying memory outside the array.

C String Functions

A large number of security holes have been caused by misuse of the C standard library functions strcat and strcpy. Some people blame the language for not being robust. Another way to look at it is that people are using the library without understanding how it works.

Terminated C Strings

One last example dates from early in my programming career. I found the following line in a C program.


     str[strlen(str)] = '\0';

In fact, this same idiom was repeated in many places in the code. It turns out that the programmer had come to C from another language. When learning C, he had read that every C string must be terminated with a nul character. He intended this to set the character after the end of the string to nul. Unfortunately, he didn't realize that strlen works by looking for the nul. This makes the line an expensive no-op.

The more complex the library, the more likely that some programmer will not understand it. This means that hiding complicated code by putting it in libraries may not solve your problem.

Project Libraries

Let's say that somehow we could argue that the library solution would actually make complicated algorithms and data structures usable for everyone. Shouldn't that same argument apply to your project's code? Shouldn't your programmers be able to write a set of code to wrap up complicated logic and make it usable to the entry level people?

If the library is well designed, with good abstractions, and documented very well, they can definitely abstract away some of the complex problems in the code. This approach makes the code easier to understand and maintain for junior programmers.

The problem, of course, is that you can't use a library to encapsulate knowledge and still write the internals of the library without the need for that knowledge. In general, the critical functionality of the code is usually entrusted to the more senior people. They must understand the internals, in order to write the library code. So, at a minimum, the library itself cannot be LCDC.

Summary

Libraries are not a panacea for the LCDC problem. Programmers can find ways to misuse libraries if they don't understand the algorithms and assumptions used by the library. Moreover, if libraries could solve the problem, then your project should be able to use the same approach by hiding knowledge in libraries. But, that violates the LCDC assumption because the library cannot be written without that knowledge.

In the next post, we'll start looking at a way to get rid of the LCDC assumption.

For the rest of the posts in this series, check out The Myth of Code Anyone Can Read.

LCDC: Fundamental Knowledge

2015-06-19T14:25:37Z

In The Myth of Code Anyone Can Read, I introduced the idea that least common denominator code (LCDC) is not a good approach to writing software. One reason for this problem is caused by the knowledge base of your average programmer.

Different Programmers Have Different Backgrounds

Programming is still a relatively new field. It's also a pretty broad field. A person claiming to be a programmer or software engineer could have learned their craft in any of several ways:

Self-taught: on-line tutorials, books,ongoing self study
Computer science degree
Management Information System degree
Programming course in a different degree program
Programming boot camp
Internship at a programming shop

Each of these can result in either really good or not-so-good programming skills. In addition, the terms programming and software development can also be applied in very different areas.

Embedded systems
Hardware driver development
Scientific software
Website development
SCADA software
Financial software
Game development
Graphics programming
Smart phone app development
Automotive software
High availability software
... and many more

Each of these different areas have very different ideas of what knowledge and skills are fundamental. You can't necessarily take a website developer and have them be productive on an embedded systems project. You might not want a game developer working on software for pacemakers.

Given different backgrounds, specifying a minimum level of knowledge becomes much harder.

Data Structures

Let's start simple. If we want to write LCDC, we can't use any data structures that aren't understood by everyone. So, we can probably guess that most people would understand arrays. That is pretty fundamental. What about others[1]:

Linked lists
Binary trees: basic binary, AVL, or red-black trees
Generalized trees: tries, suffix trees, octrees, B-trees, R-trees
Graphs: DAGs, spanning trees
Stacks
Queues: FIFO, dequeues, priority queues
Hash tables, associative arrays, dictionaries
Heaps

Most programmers of my experience are not familiar with many of the data structures above, much less all of them. Some of these data structures underlie programming tools we use every day. Others are more specialized. Some are extremely well-known in one industry or company and virtually unknown in others.

If we really want LCDC, these data structures and the advantages they give would be unavailable. After all, most programmers don't know how a red-black tree or hash table work, so how can we write code that uses them?

Fundamental Algorithms

Data structures aren't the only fundamentals that we can't rely on everyone understanding. Many of the algorithms that we depend on are opaque to the average developer.[2]

Sorting: quicksort, insertion sort, heap sort, merge sort
Security: SHA-256, AES, Diffie-Hellman key exchange, cypher-block chaining, HMAC
Graphics: JPEG compression, ray tracing, bezier curves
Databases: SQL, document databases, object databases, hierarchical databases
Randomness: Fisher-Yates shuffle, Mersenne Twister, entropy pools
String manipulation: regular expressions, longest common sub-sequence, hamming distance, Levenshtein distance, KMP algorithm
Graphs: Dijkstra's algorithm, alpha-beta pruning, topological sort

In some fields, each of these algorithms are commonly used. In others, each is completely unknown. Even in the fields that a particular algorithm is used, most developers probably don't understand all of algorithms used in that field. According the LCDC premise, we can not use any algorithms that everyone can't understand.

Summary

Because of the breadth of the programming field and the many different ways that individuals came to work in the field, it is very hard to describe a subset of knowledge that we can claim is known by everyone.

Not all of these apply to every business, but most programs end up touching one or more of these areas somewhere. Our code would be slower, less correct, and harder to maintain without being able to take advantage of well-known and well-tested algorithms, even if they are beyond the grasp of your most junior people.

In the next post, I'll explore libraries to solve this problem. We'll also see how they would be impacted by the LCDC idea.

Notes

Apologies if I've left out your favorite data structure. I just wanted a list big enough to get the point across.
Since there are even more algorithms than data structures. This is an even more incomplete list. On the other hand, I suspect that more of these will be unknown to more programmers.

The Myth of Code Anyone Can Read

2015-06-18T14:40:35Z

I got into a conversation recently coming out of the Houston.pm user group meeting. As usual, we wandered over numerous technical topics, but one stuck out in my mind: whether or not to use more advanced or more complicated language idioms.

I've written about programming idioms and advanced code many times in the past (see below). Part of the reason for revisiting this topic repeatedly is a mindset that I have seen throughout my career. The idea is to write the code so that anyone can read it. Although this sounds reasonable at first, lowest common denominator code (LCDC) almost always results in a hard-to-maintain code base.

The Failure of LCDC

There are a number of reasons for this simple idea to fall apart. The most obvious comes from your experience of reading text in a human language. If we wanted to keep the text at a level that anyone could read, everything would need to be written at a first grade level. That's really the lowest level that you can claim that someone can read.

In human languages, text is written at different levels depending on the context and expected audience. Why would you expect programming to be different? Over the next few entries, I plan to cover some of the context that would change the way code should be written.

Over the course of the next few entries, I plan to show different places where writing lowest common denominator code (LCDC) would harm the project and, possibly, your business.

References

BPGB: Readable Code

2015-05-22T14:54:14Z

As you probably know, code is read more often than it is written. Anybody who has worked on code written by anyone else has probably wished that the code were more readable at some point.

Writing readable code should definitely be considered a best practice. The problem comes when defining what you mean by readable.

The Clever Coder

The situation that normally causes a readable code mandate is normally one or a small number of people making code that is hard for the team to read. Let's call this person the Clever Coder. This programmer loves obscure features of the language and uses them whenever he can. He is likely to use unusual algorithms with the justification that the are more efficient or robust without proving those assertions. His code often contains obscure references or jokes that are only familiar to a small number of people. These references rarely have any connection with the actual purpose of the code.

At some point, someone else has to deal with this guy's code and the resulting complaints (and probably bugs from obscure interactions of changes) causes someone in authority to mandate Thou shalt write readable code.

How to Define Readable

So who determines what is readable code? If you define readable as code that your current team considers as readable, this mandate is not too bad. If you have code that must be understandable to people outside your team, you could get input from them.

Unfortunately, since the problem is normally caused by someone clever writing code that's completely obtuse, the normal counter response is normally to go too far in the other direction: the code should be readable by as many people as possible.

The Novice Assumption

At this point, someone usually decides to approach readability by one of two measures.

Can the manager read it?
Can some hypothetical new programmer read it?

If the manager was recently a tech type on this code base, her insight might still be good. If not, we have what is effectively a non-native speaker determining the readability of the code. As you can guess, this may not turn out well.

The problem with the hypothetical new programmer is that this usually quickly devolves to the an assumption of someone completely unfamiliar with the code and the language. The result is an assumption that the code must be readable by anyone who just walks in the door. The kind of lowest common denominator standard guarantees verbose and awkward code.

Lowest Common Denominator

The worst versions of this I have seen involve mandating that the code should be perfectly clear to someone who is not even familiar with the language. (If that seems silly, I agree. But, I have seen that requirement before.)

This results in code that is horrible from the viewpoint of someone with any level of experience in the language. It's usually unnecessarily verbose, but every little detail is spelled out in excruciating detail. This view is often pushed by the people with the least experience in the language or code base or by managers who once programmed in a completely different environment.

The Self-Fulfilling Prophecy

One result of a code base written to be readable by the least experienced person we can imagine is that the only people who will be willing to work on it are those people with the least experience. Your more senior developers will avoid this code if possible. If they can't avoid the code, they will leave for places where they can work more effectively. Before long. your code will be maintained by people who are only comfortable with code where everything is spelled out in detail.

Now, all code must be written in this style because the programmers that remain can only work this way.

The Trade-off

I've actually written several posts on this topic in the past. Choosing the audience you are using for defining readability has a strong effect on who will be willing and able to work on your code. Not realizing that different parts of your code may have different audiences, requires that everything must be written to the lowest common denominator.

Aiming for a simpler standard of readability guarantees that more people can read it in the same way that drivel on TV guarantees a wider audience. Aiming for a more advanced level of readability reduces the number of people who can read the code, but that audience will necessarily be more advanced. Readability becomes a filter. On the opposite side of the lowest common denominator, we are back with the original example of code that can only be read by an audience of one. Like everything else in programming, readability involves trade-offs.

Conclusion

Readability becomes a BPGB when we don't think about the audience we are targeting. As usual, best practices must be applied thoughtfully to actually be best practices.

The Best Practices Gone Bad series contains more explorations of best practices taken to unfortunate conclusions.

BPGB: YAGNI Overdone

2015-04-30T14:05:54Z

One of the design ideas that came out of the early days of the agile movement was YAGNI. As I have written before, this idea is push back against the tendency of many programmers to over-engineer or over-complicate our designs. We normally use some variation of the flexibility argument to justify this tendency. In fact, it's often driven more by being impressed with our own cleverness.

Many very experienced programmers have always been aware of the need to simplify designs. But, not all of us were that experienced. In the early days of agile, the YAGNI principle was new enough that it made a good shorthand for less experienced programmers to learn to rein in their design exuberance.

A Good Idea Taken Too Far

As with any idea, eventually people begin to apply the principle without the supporting experience or history and the result is dogma. I have seen people create overly simplistic designs in the name of YAGNI, even though a slightly more complex design is obviously needed.

An Example: Database

For example, let's say that we are going to build a system for tracking patrons to a library, including their book borrowing history. For any library larger than a single shelf, we are obviously going to need to use a database of some sort.

For the YAGNI fanatic, it might seem reasonable to start with a flat file and only change to a RDMS when we actually have enough records to matter. They would then push back against the RDMS solution as long as we could possibly do the work in a flat file.

The obvious downside is that a large amount of code will be needed to implement features for the flat file, that are already supported by the database. This means that we will write extra code that in the best of cases will be thrown away as soon as we change to the right technology. Worse, the system might end up depending on some of the side effects of the simplistic solution that are hard to do with the better solution. Now we'll need extra work to fix the better solution.

An Example: Templates

If you are working on a web-based system, you learn quickly that much of your output will be wrapped in boilerplate HTML for display to a user. Most languages support at least one template library for merging data and text to generate output. People that have never used a template library, or have had a bad experience with one, often decide that templates are a YAGNI issue. After all, what could be easier than just writing text to the output or a file?

The result of YAGNIing the use of templates is usually one of two approaches. The first ties the output tightly to the actual code doing the work to the point that changing the output may modify the functionality of the code. The other approach ends up growing an ad hoc, crippled, half-way templating system that is specific to this code base.

Conclusion

In both of the examples above, the issue revolves around not thinking ahead. The YAGNI principle was intended to avoid thinking too far ahead. It was never intended to keep you from thinking ahead at all.

In the library example, if you are writing code for a library of a dozen books and five people, a flat file approach is probably fine. Moreover, the problem may not be worth the overhead of running a database server.

In the template example, if you are writing a single page webapp that is a simple report, the time to learn a templating system may not be worth while.

However, if either example moved beyond the most trivial example, the more advanced solution is not a frivolous complication. In both cases, You Really Are Going To Need It. Misapplying YAGNI in this case is just generating pain down the road.

BPGB: The Witch Hunt

2015-03-01T04:25:31Z

For the next entry in the Best Practices Gone Bad series of posts, I have a topic I wish I had thought of.

In issue 125 of Overload magazine (Feb 2015), Sergey Ignatchenko wrote an article entitled Best Practices vs Witch Hunts. Sergey covers a somewhat different approach to how best practices go bad. He walks through a series of steps leading from identifying a practice that a few teams find useful, through declaration as a best practice, to an end point of applying the practice religiously without regard for whether it is appropriate.

One thing I don't think I have described well in my posts is the point that best practices are trade-offs, just like everything else we do in development. Since any best practice is really a rule of thumb that works in many, but not all circumstances, you can't enforce a best practice without thought. Sergey does a good job of getting this concept across in his article.

Many of the best practices gone bad that I've described (and more that I intend to) only become a problem when someone uses the practice in a place where it is not the best trade-off. Sergey covers how people can become zealots enforcing a best practice to the point that it becomes a BPGB. He also touches on the social aspects of this kind of failure. I definitely recommend the article.

BPGB: Feature Branch Fail

2015-02-14T18:27:34Z

The past few generations of version control systems have really good support for branching. This feature allows someone to create a new line of development that is separate from the official main/master/trunk line of development. Changes made on a branch do not affect the main development until they are merged to the main branch.

Shortly after people become comfortable with branches, they notice that branches can solve the problem of big changes breaking the master branch.

Feature Branches, In Theory

One workflow used to keep the master branch clean is Feature Branches. The idea is to create a new branch for each bug fix, change, or new feature that you want to work on. As long as the change is not complete, you continue making changes on your feature branch. When everything is working satisfactorily, you merge the feature branch back into the master branch.

For small changes, this works incredibly well. Different people can work on their own tasks independently of each other until they are ready to merge. Nobody else's changes affect your work until you are ready to merge. Everything seems wonderful and bright.

The First Cracks

The first problems in this perfect workflow occur when two people make overlapping or conflicting changes. The second person to merge gets a merge conflict and/or failing tests. This is actually not that much different than the problem with doing all of the changes on the master branch. Modern DVCSs usually handle the textual conflict pretty well. A good test suite will usually uncover semantic conflicts pretty quickly.

This means that no one tends to see this as much of a problem.

Long-running Feature Branches

Feature branches work pretty well for the short term, but what happens as the branches live for longer?

The longer the feature branch stays disconnected from the master branch, the more potential conflicts build up. The advantage of having changes to the master branch not affect your feature branch comes at a cost of merge pain later. Doing regular merges from the master branch into the feature branch can mitigate this problem at the cost of more frequent, smaller potential pain points.

Just as importantly, the longer the feature branch runs, the more changes it collects relative to the master branch. This means that when the feature branch is finally merged, it will probably cause disruption to anyone else working on master or their own feature branches.

Positive Feedback

Probably the worst part of this problem is an annoying feedback loop caused by merging. As long as merges go well, people are willing to do them regularly. When a merge is conflicted, however, the developer may find they have an aversion to doing frequent merges. After all, if any merge could be painful, we probably want to reduce the number of opportunities for pain.

Unfortunately, the longer you go between merges, the more likely you are to have merge conflicts (all else being equal). This reinforces your idea that merges are painful and makes you more likely to put them off. This positive feedback loop can pretty rapidly create worst case feature branch merges.

Dependency on a Feature Branch

Another fun failure mode for feature branches is when someone creates a feature branch that depends on an unfinished feature branch. This shouldn't happen often, but it becomes more likely the longer a feature branch lives.

As an example, let's say that Bob is working on a long-running feature branch that replaces the logging system used by our server. Sue starts a new feature to add fail over capabilities to the server allowing us to have more than one server at a time. In working through the problem, Sue realizes that Bob's new logging would make some of her work simpler. She merges Bob's changes into her feature branch after talking with Bob to learn how the feature can help.

The problem occurs when Sue finishes before Bob. She merges her code into the master branch and suddenly Bob's unfinished code is part of the system. This partial code now ripples into every feature that is keeping up with master. If things go well, Bob can finish his feature soon and merge it in. If his code really isn't ready for general use, then everyone is at least partially broken for a while.

Conclusion

Short-lived feature branches are a good idea. The longer a feature branch lives, the more effort is needed to keep potential problems in check. People who are new to this workflow often get stuck on the idea of the independence of the branches and don't realize the problems lying under the surface.