January « 2007 « Cowboy Programming

January 4, 2007

Programming Poker AI

Filed under: Game Development,Inner Product — Mick West @ 6:17 pm

This article was originally published in the “Inner Product” column in Game Developer Magazine, November 2005

I recently programmed the AI for the World Series of Poker, developed by Left Field Productions and published by Activision. I started out thinking it would be an easy task. But it proved a lot more complex than I initially thought.

This article for the budding poker AI programmer provides a foundation for a simple implementation of No-Limit Texas Holdem Poker AI, covering the basics of hand strength evaluation and betting. By following the recipe set out here, you will quickly become able to implement a reasonably strong poker AI, and have a solid foundation on which to build. I assume you are familiar with the basic terminology of poker.

TEXAS HOLDEM

The goal of any game playing AI is twofold. The primary purpose is to allow the player to have a fun and enjoyable experience. The secondary purpose, subordinate to the first, is to play a strong enough game to provide sufficient challenge to the majority of players in your intended audience.

POKER DATA TYPES

You will need an implementation of the following data types. I’m going to describe them at the bit/byte implementation level, leaving the high level abstraction up to you.

A “suit” is an integer in the range 0..3, where 0=Clubs, 1=Diamonds, 2=Hearts, 3=Spades

A “rank” is an integer in the range 0..12, where 0 = 2 (deuce), 1 = 3, 11 = King, 12 = Ace. This is the cards in a suit arranged in rank order

A “card” is an integer in the range 0..51, hence
card = suit*13 + rank.
Suit = card/13
Rank = card%13

A “Hand” is a 52 bit data type, where each bit represents a single card. This can be stored as four 16 bit words for ease of use, where each 16 bit word represents the potential cards in one suit (using 13 of the 16 bits) (figure 1)

A “Hand Type” is an integer representing the type of poker hand you have, where 0= no pair, 1=pair, 2=two pair, 3=trips, 4=straight, 5=flush, 6=full house, 7=quads, 8=straight flush.

ENCODING HAND VALUES

A “Hand Value” is a 32 bit integer representing the relative value or strength of any hand of cards. By comparing two hand values, you can see which hand is stronger in a game of poker.
The hand value can conveniently be represented as a series of six 4-bit nibbles, where the most significant nibble represents the Hand Type, then the next five nibbles represent the different ranks of the cards in the order of significance to the hand value. (figure. 2)

Example 1: AH QD 4S KH 8C is a “no pair” hand type (sometimes called a “high card” , or in this case “Ace high” ). So, the hand type nibble is set to 0. The remaining nibbles in the Hand Value are filled out with the ranks of the five cards in descending order. (A, K, Q, 8, 4), which translated into rank indices: 12,11,10,6,2 (or C,B,A,6,2 in hexadecimal), and when combined with the hand type (0) in the high nibble, gives us a 32 bit integer: 0x000CBA62.

The individual suits of the cards are basically ignored in the final hand value. The only time suit is significant is when it contributes to a flush. Also, note the top two nibbles of the Hand Value are always zero.

Example 2: 4D JD 3D 4C AD is a pair of fours, with Ace, Jack, Three kickers. The hand type is a pair, (type 1), then the ranks follow, starting with the rank of the pair, then the ranks of the kickers, so 4,A,J,3, which gives us 0x0012C910.

Example 3: 7C, 6C, 5C, 4C, 3D is a straight (type 4). More specifically it’s a seven high straight. The only rank of import here is the seven (rank 5). So the hand value is encoded as 0x00450000. We save ourselves a bunch of instructions in ignoring the four low cards after we’ve determined it is a straight.

Look at the resultant hand values of the above examples, you can clearly see how the better hands always have a higher hand value, making determining the wining hand a simple comparison.

CALCULATING HAND VALUES

What we now need is a function that takes a hand, and returns a hand value. This involves determining the hand type, then inserting the nibbles for the hand ranks, as above.

A hand is four words (clubs, diamonds, hearts, spades) of 13 bits each. 13 bits can be arranged in just 8192 combination, which means we can accelerate the evaluation of a hand by pre-calculating 8K tables of things like the number of bits set in a (13 bit) word (if you have five or more of the same suit, then you’ve got a flush), or the highest card of any straight in the hand. You can also pre-calculate a table of the highest five cards from a particular bit combination, which you can then use to set the kicker cards.

If you calculate ranks = (hearts | diamonds | clubs | spades) then the value ranks is a bit-field with a bit set for every card rank that you have at least one of. The number of bits set here is the number of unique ranks you have. We calculate the number of bits in each of hearts, diamonds, clubs and spades, and subtract the number of bits in the unique ranks, giving the number of duplicated ranks, to be used as the basis of determining what type of hand you have.

Example: if you have 2D AS AH 2C 2H, you can very quickly determine that you have five cards, that there are just two unique ranks, and hence you must have either a full house or four of a kind. A few more simple tests will determine exactly what you have. The entire evaluation function will consist of tests like this, gradually whittling down the possible hand types.

Since the function consists mostly of bitwise operations, table lookups and simple comparisons, it is going to be very fast. It’s also very amenable to fine tuning optimization, and the exact implementation will depend on the target architecture. You may be able to take advantage of some processor specific instructions to greatly improve the efficiency.

CALCULATING HAND STRENGTH

Hand strength is the probability that you will win the hand, given your hole cards, the community cards, and the opponents who remain in the hand. Hand strength is a floating point number between 0.0 (certain loss) and 1.0 (certain win). For example, a HS of 0.33 means you have a 33% chance of winning.

The easiest and most flexibly way of calculating the HS is to simulate the progress of the game a very large number of time, and count the number of those times you win. Say you simulate the game 1,000 times, and in the simulation, you win 423 games, then you have a high degree of certainty of having an approximate HS of 423/1000, or 0.423.

The procedure for simulating a game is very simple:

Create a pack of cards
Set score = 0
Remove the known cards (your hole cards, and any community cards)
Repeat 1000 times (or more, depending on CPU resources and desired accuracy)
Shuffle the remaining pack
Deal your opponent’s hole cards, and the remaining community cards
Evaluate all hands, and see who has the best hands
If you have the best hand then
Add 1/(number of people with the same hand value) to your score (usually 1)
End if
end repeat
Hand Strength = score/number of loops (1000 in this case).

To be more accurate, we have to run our simulation with people dropping out if they are dealt hole cards below a certain threshold. In practice, the determination of if a player stays in or not in a simulation is a probabilistic function of the strength of their hole cards, their table position, their stack size, the blind size and their previous behavior. For now we can just modify the simulation, so after dealing the opponents hole cards, remove any non-blind players with hole cards worse than, say, a pair of sixes. While not particularly elegant, it will still give you a useful number.

POT ODDS

The pot odds number is the ratio of your bet or call to the size of the pot after you bet (the amount you will win). For example, if the bet is $20, and there is $40 in the pot, then the pot odds are 20/(20+40) = 0.333.

RATE OF RETURN

Rate of return is the “on average” proportion of how much you will multiply your bet by, if you stay in the hand.

Rate of Return = Hand Strength / Pot Odds.

The base strategy we implement is to mostly stay in hands with a rate of return greater than 1.

THE FOLD/CALL/RAISE DECISION

For each round of betting the computer needs to decide if it is going to fold, call or raise (The FCR decision). Ignoring the question for the moment of how much to raise for now, then given a Rate of Return (RR), it’s possible to provide a very simple (yet useful) mapping between RR and FCR.

If RR < 0.8 then 95% fold, 0 % call, 5% raise (bluff)
If RR < 1.0 then 80%, fold 5% call, 15% raise (bluff)
If RR <1.3 the 0% fold, 60% call, 40% raise
Else (RR >= 1.3) 0% fold, 30% call, 70% raise
If fold and amount to call is zero, then call.

Don’t pay too much attention to the precise percentages listed above, the numbers will depend on the way you calculate your hand strength, and you’ll want to vary them depending on which betting round you are in. You will also want to vary these numbers to create players with different personalities.

Using this very simple mapping between the RR and the FCR decision can give you a surprisingly reasonable and entertaining player. They will tend to play strong hands, they will occasionally bluff, they won’t scare easy if their hand is good, and they will abandon weak hands when raised, and they will stick around on a reasonable chance of a flush or straight draw, making for entertaining gameplay.

The fact that none of the percentages is 100% is also important. That means you can never deduce the hand strength of your AI opponent based on their actions (unless they fold, where the information does not really help you). If they raise, then they could have any kind of hand strength – probably a good one, but it might be the 1 in 20 times when they are bluffing with a very weak hand.

STACK PROTECTION

The simple rules above work well when your stack of chips is large and the blinds are small. However as your stack shrinks and the blinds increase then the amount of money you need to commit to stay in a hand can become a very substantial proportion of your stack. Also, occasionally other players might go “all-in” , betting their entire stack of chips, so we need some logic to prevent the AI from making bad calls when short stacked.

Say you have AD, 2D and the flop is QC, KC, 2C. So you have a pair of twos, but there is a possible flush out there. There is $500 in the pot and the bet is $100 to stay in against two player, but it’s your last $100. The pot odds are 100/600 = 0.1666, your hand strength is 0.297, so your rate of return is about 1.8. So if you could play this situation over and over again you would make on average an 80% profit each time. However, it’s your last $100, and you have about a 70% chance of loosing everything. Don’t make that bet!

To handle this we can use a simple heuristic, along the lines of:

“If my proposed bet will substantially commit my stack, then don’t do it unless I have a strong chance of winning”

which might be implemented in part by:

“if (stack- bet) < (blind * 4) and (HS < 0.5) then fold”

Meaning if the call would leave you with less than four times the big blind, then don’t call unless you have a greater than 50% chance of winning.

Poker is a complex game, with a surprisingly large number of different types of situations like this that you have to handle somehow. I recommend you have as few special cases as possible, as it reduced the risk of an exploit being introduced into the game via some obscure special case. However, you should anticipate a number of heuristics (rules of thumb) being hard coded into the AI logic.

TESTING POKER AI

Playing a quick single table game of Texas Holdem takes around 30 minutes on average with human players. Ideally you would perform your testing by having human players play against the AI and trying to find problems with it. Unfortunately, due to the random hands being dealt, it’s very easy for one player to simply get lucky and win the game with sub-par logic, or even flawed logic. I’ve found it takes at least ten games to begin to get a clear picture of the qualities of an AI player, and more like a hundred games to be really sure. This often creates an unreasonably burden on the testing department, and introduces a very long delay in getting feedback on AI changes.

The solution is automated testing. The AI should be set up so that different variants of AI can play against each other in a very high speed set of games. You should also code a few simplistic poker AI’s into the mix, such as an AI that always goes all in, or another that simply always raises with a hand better than a pair of fives. Then you set your AI loose against these opponents, and make sure that it wins the appropriate percentage of games. If you coded your evaluation and simulation appropriately, then you should be able to simulate an entire game in about a second. (You might want to reduce the iterations of the simulation a bit to speed up testing).

The best use of your human testers is to try to get them to find an exploit of the AI, then you can codify this exploit into a temporary AI opponent to include in your test suite. You can then tweak your AI until it defeats the exploit, while still being able to defeat all the other (standard) opponents.

MORE WORK

What I’ve set out here is just a foundation for poker AI. By following the process laid out here you will get a reasonably strong and entertaining opponent. Here’s a quick list of the topics you might want to look into

”¢ Pre-flop hand strength tables
”¢ Opponent modeling.
”¢ Implied Odds.
”¢ Personality modeling
”¢ Positional play
”¢ Probabilistic search space
”¢ Game theory and Nash Equilibrium.

RESOURCES:

– Sklansky, David, The Theory of Poker, 1999, Two Plus Two Publishing. – Provides various discussion of pot odds, implied odds, etc, with many heuristics that might be useful.
– The University of Alberta Computer Poker Research Group:
http://www.cs.ualberta.ca/~games/poker/ A number of research papers on implementing poker AI.
– Hold’em Killer, Evin Peretz, http://www.holdemkiller.blogspot.com/ – A blog on implementing poker AI.
– Poker-Eval, http://freshmeat.net/projects/poker-eval/ – A GPL Licensed poker hand evaluation library.

Comments (21)

Debug and Release

Filed under: Game Development,Inner Product — Mick West @ 6:05 pm

This article originally appeared in the “Inner Product” column, Game Developer Magazine, October 2005

Introduction

Game projects usually have a number of “build configurations” , which control how the game code is compiled and linked into the game executable. Each build configuration, or build mode, builds the game with varying amounts of additional debugging code, and with the optimization options modified to aid debugging. Microsoft’s Visual Studio has by default just two: “Debug” and “Release” . The idea is that you will develop your game in Debug mode, making it easier to find bugs, and then you ship it in Release mode, with debugging code removed and optimizations switched on to make your code small and fast.

This division usually starts out fine. At the start of a game development cycle there are not too many assets in the game, level maps are small, not much of the logic is implemented and the CPU is rarely taxed. But after a few months of development, things start to get into the game. As the quantity of game assets, entities and logic approaches the level of the final game, debug mode becomes unusably slow. This leads the development team to switch to using release mode for day-to-day development work.

This situation can make bugs a lot harder to track down. The traditional release mode lacks debugging code and assertions. When the programmer attempts to reproduce the bugs in Debug mode, they first have to rebuild in Debug mode (which can take a very long time) then the code runs at a really slow frame rate, making it very difficult to reproduce the bug, and sometimes the bugs cannot be reproduced at all, due to the different configuration and initialization of memory.

I argue that the traditional division into Debug and Release is inappropriate for game development. Debug mode and release mode as traditionally envisaged should no longer be considered useful options, and should be replaced by a single build configuration “develop mode” that should be used at all stages of development.

Debug and Release mode

When you set up a project in Microsoft Visual Studio, you get two “build configurations” : Debug and Release. The distinction between these seems obvious: You use “Debug” to debug the project, and when it’s ready to ship you switch over to “Release”

This distinction is usually found in any game development project, and sometimes it’s extended to some additional configurations, like “ExtraDebug” , “Final” or “Submit” .

The intent of debug mode is to make the code easier to debug. Now we’ve become so used to having the distinction between debug and release over the years that it’s useful to take a step back and see exactly why we needed a debug mode in the first place.

A common misconception that I’d like to clear up right away is that you can’t use the debugger unless you build your code in debug mode. This is simply not true. The debugger will work just fine in release mode, but some aspects will be a bit harder to debug.

The idea behind debugging mode is to make it as easy as possible to track down bugs in your code. This results in two major differences:
1) The code is compiled in such a way that the debugger works as well as possible.
2) Extra code is compiled in, usually in the form of assertions, to trap bugs as early as possible, and to provide additional debugging information.

Optimization: In debug mode, optimization is disabled, so the code runs a lot slower. Why is this? Well, optimization involves re-ordering the sequences of assembly instructions so there is not always a direct correlation between a group of assembly instructions and a line of C++ code. So when you step through your code in the debugger, it’s not always clear exactly where you are in the code, and the PC might jump oddly between lines.

Also, optimizing involves keeping values in registers instead of memory as frequently as possible. So, local variables that are not used very much are not stored in the local stack frame. This means that in the debugger you often can’t examine the contents of a local variable in optimized code, as one you step past the code that uses it, the value no longer exists. In debug mode the local variables always have local storage for each local variable, so you can always examine their values in the debugger.

Inline expansion: Another form of optimization is the expansion of inline functions. This is done to clarify the flow of control in the debugger. If an inline function is expanded inline then you can’t step “into” it, and you can’t step “over” it. It almost becomes invisible to the debugger.

Debug Code: Besides the optimizations, you usually add additional “debugging code” to your program to help track down bugs. To this end you generally have some symbols defined that tell you at compile time which build configuration you are using, so you can conditionally compile code in for debugging.

In Microsoft visual studio these symbols are “DEBUG” which if defined when you are in debug mode, and “NDEBUG” which is defined when you are not in debug mode (or in release mode)

Assertions: Everyone should be familiar with assertions. Assertions are additional line of code that are sprinkled through the program to ensure that everything is as it should be. Basically it’s a macro that checks (asserts) that a condition that you know should be true actually is true. If the assert fails, then the program execution halts, and you can see what went wrong. Asserts should be your most useful tool in debugging your code.

Let’s have a quick look at Microsoft’s implementation of the assert macro:

#ifdef NDEBUG
#define assert(exp) ((void)0)
#else
#define assert(exp) (void)( (exp) || (_assert(#exp, __FILE__,__LINE__), 0) )
#endif /* NDEBUG */

As you can see, the assert is compiled away to nothing when you build in release mode. This makes sense, as when you release the game, you don’t want asserts going off. Plus they take up space, and all those extra checks slow down the game.

Debug and Browse information: For the debugger to work it needs to know how to link the code it is executing to the original source. This information is generated during compilation and linking and is referred to as “debug info” . In many “release” configurations the debug information is switched off. This is generally to make the project build faster, and to make the executable smaller. An executable .ELF file built with GNU C++ can be anything use to 100MB in size when built with full debug info, compared with a 3-5MB file without debug info.

Memory allocation and initialization: Memory allocation is the source of many bugs. So it makes sense to have extra debugging code in your memory allocator. Typically memory will be initialized to a known value when it is allocated. The pattern of allocations will also differ, as blocks have extra debug info added. Since most games use their own custom allocators, the differences vary. But nearly all games will have some difference in memory allocation between debug and release modes.

Uninitialized variables are a common source of error. Generally you should be catching such problems at compile time. But if not, then you need to realize that their behavior will differ in release and debug modes. In debug modes, local variables are more likely to have actual storage, and depending on your precise build configuration and compiler, they will probably be initialized to zero in debug mode. In release mode the uninitialized variable will be some random value – whatever happened to be in memory or the register used.

Game are real time applications. They have to run at a reasonable speed in order for you to say that they are working. So if the debug mode is very slow, then it’s not going to be practical to use the debug mode as the development build – the build used by artists and level designers implementing assets, and by programmers implementing new features in code or script.

However games are also incredibly complex applications, where the game engine is often still in development while the game content is being created and implemented using that game engine. If everyone is using the release mode for development then it makes it very difficult to track down the bugs that inevitably arise in a development environment.

Clearly then neither of the traditional build modes is suitable for game development. I’d like to make the case that a single hybrid mode should be used for 99% of all development. This hybrid mode (which I’ll call “Develop Mode” ) should be fast enough so that gameplay is not affected, and yet contain enough debugging features so that bugs are caught early, and easily tracked down. I’ll also make the case that develop mode should essentially be the configuration mode your game ships with.

Develop Mode

Assertions switched on. Developing without assertions is like driving with your eyes closed. You’ll know when you crashed, but you won’t know why you crashed, and your crashes will be much worse. Having assertions on all the time during development will greatly improve the rate at which you find and fix bugs.

Optimization switched on. Your code needs to be fast if develop mode is going to be used by artists and level designers. Sure, you can’t tell exactly what is going on in the debugger, you won’t be able the see the contents of local variables. But you will be still be able to identify the place where in the code the crash occurs, see the call stack and roughly follow the logic flow. If you need more information then often the solution is to add more assertions. You can add logging calls to track the contents of variables, and if all else fails you can temporarily switch on optimization.

Inline Expansion switched on. Similar to optimization, but with games the inline expansion being off is often a far greater source of slow debug code than other aspects of optimization. Most games will have some kind of custom 3D vector class, usually with accessor functions, or overloaded [] operators that use inline functions. Having these functions be explicit adds a vast overhead to code execution. The benefit you get is simpler flow of execution in the debugger, something you rarely need. So, switch inlining back on.

Link without debug information – This one can be a vast time saver. The link stage of the edit-compile-link-run cycle can take over a minute, or even several minutes, depending on the type of project. The vast majority of that time is spent in generating debug information, when really all you want is the executable. Remember the compilation units are still being complied with debug information, so if the code crashes, and you want to go into the debugger, then you can just switch debug information back on and re-link, and you’ll have the debug information, but only when you actually need it.

Make your assertions fast. Assertions should never need more than 5% of your total CPU time. If you turn assertions on and your framerate plummets, then there may be a problem in the implementation of your assertion macro (many game developers implement their own version of assert). The majority of assertions are simply comparisons of two values, and usually one of these values is something the compiler will have in a register, and the other if frequently a constant. So your assertion should compile to two or three instructions that perform the test and then skip over the code that arranges the parameters and calls the assert handler. You should verify this by looking at the compiled code in the debugger.

Use assertions appropriately. Much as I love assertions, there is such a thing as too many assertions. Assertions in very low level functions are often testing the same thing over and over again. For example, collision detection code might use unit normals stored in the mesh. Since collision code runs a lot each frame, then adding an assertion to verify that the normals were of unit length might have a serious impact on framerate. Plus you’d be repeatedly checking the same normals over and over. In this case it might be better to verify the input just once when the mesh is initially loaded, or even when it’s originally generated.

In addition, putting assertions in at such a low level of the code is often hit and miss. The conditions that might cause the code to fail could occur very infrequently. You might have to play the game for hours in order to hit the triangle with a malformed normal. Here you want to use automated tests that ensure full coverage of the code and data. These tests can be run constantly, and need not be part of the develop build.

Ideally your develop mode should also be your release mode. This automatically eliminates that bane of game development: the bugs that only show up when the game is in release mode. The problem here is that you’ve got to devote up to 5% of your CPU time, and a chunk of memory, if you want to leave in all your assertions. The solution is obviously to budget for that from the start.

This may seem like a lot of ask. The harsh reality is that game developers are often scrambling for a few thousand extra bytes, and CPU time is never adequate. But consider the benefits of having just one build configuration up to and including the version you ship. There is no risk of obscure bugs popping up due to changes in the code. If you ship for console you can get back more useful information from publisher tests. If you ship on PC, you can add a facility for users to report assertions, which will allow you to get that patch out quicker.

If you ARE going to ship without assertions, for whatever reason, then make sure you budget enough time to test that version. The majority of your testing should be done on a version with assertions in, as you’ll track down ordinary bugs much quicker. But at some point you’ll need to test your final version. Just don’t switch off the assertions the day before you submit. It’s actually might be a good idea to occasionally test with assertions removed, as the different code configuration might bring more obscure bugs to the surface.

Reference and further reading

Rabin, S. Squeezing more out of Assert. Game Programming Gems 1.
Etherton, D. Designing and Maintaining Large Cross-Platform Libraries.. Game Programming Gems 4.
Hunt, A & Thomas, D. Leave Assertions turned on. The Pragmatic Programmer. p123.

Comments (0)

January 3, 2007

What is Cowboy Programming?

Filed under: Cowboy Programming,Game Development — Mick West @ 11:36 pm

Cowboy Programming is a description that is often applied to game programmers. It implies an individualistic style of programming, where the programmer creates ad-hoc solutions to problems based on their personal experience and knowledge base, rather than implementing a standard solution, or one arrived at by teamwork.

A good cowboy programmer is able to quickly investigate a problem and create an appropriate solution without going through the steps of a formal methodology. A bad cowboy programmer locks himself away for days and produces a mess of buggy and unreadable code.

I am a Cowboy Programmer, hopefully a good one. I like to solve useful problems. I like to understand why things work and why they fail. I like code that is clear and solid. I enjoy working as part of a team, but I also like grappling alone with a problem for as long as it takes.

Is Cowboy Programming a bad thing?

That’s a bad question. It’s like asking “Are laptops bad?”. Sure, laptops are slow, they have small screens, small hard drives and tinny speakers, they are easily stolen and hard to upgrade. But are they “bad”? No, laptops are great at what they do.

Or rather: good laptops are good at things that laptops do. A crappy laptop is crappy.

A good cowboy programmer is good at the things that cowboy programmers do. They quickly find appropriate solutions to problems based on their individual knowledge and experience.

The games industry is not a place where you specify a piece of software and then you write it. This is not a failing of the industry, it is a necessity of the environment the industry operates in. The potential problem space is too complex to submit to a formal methodology. To work in that environment you need to be adaptable, and to be able to work independently. You need the skills to solve new problems quickly. You need to be a good cowboy programmer.

Comments (0)

January 2, 2007

Pushing Buttons

Filed under: Game Development — Mick West @ 5:31 pm

This article originally appeared in Game Developer Magazine, May 2005.

Button Disambiguation: How to make your game feel more responsive and intuitive by measuring and intelligently resolving ambiguous player control

Introduction

Download the sample application:
PushingButtons.zip
158K

Have you ever been playing a game, and you pressed a button to make your character do something, and the character either did something else, or simply did nothing at all?

More to the point, if you are responsible for the design or implementation of the player control in a game, have you ever had someone come to you and tell you it’s broken, or worse: “it doesn’t feel right“ yet when you try it yourself, it seems fine to you. Is it in their imagination?

A core aspect of programming a game is converting the player’s input from the gamepad buttons into the character’s actions. On the face of it this appears a simple problem: you just map buttons to events. However due to the non-precise way that different players press buttons and perceive events, problems of ambiguity arise which lead to frustration and a feeling of unresponsiveness. The player thinks he has hit the correct button at the correct time, but, as he’s not a robot, the intent of his input is ambiguous and cannot be resolved satisfactorily with a simple mapping.

This article discusses the physical and perceptual reasons for this ambiguity, discusses how different people produce different input, describes methods for analyzing what is actually occurring, and presents a strategy for resolving input ambiguity that allows your game to feel responsive and intuitive.

Figure 1 – The layout of the Xbox controller. This square arrangement of buttons is the most common layout on current consoles.

On a typical gamepad such as the standard pads used with the Sony Playstation and the Microsoft Xbox, there are four buttons that are all operated by the player’s right thumb. In this article I’m going to concentrate mainly on the use of these four buttons.

One of these buttons is the primary button, which is used to trigger the primary action in the game. In platform games such as Mario, this is Jump. This button is X on Sony Dualshock, where it is the bottom button of the group of four. The primary button is A on Xbox, in a similar configuration to the Sony Dualshock.

For the purposes of the article I’m going to be referring to the buttons as laid out on the X-Box. Mostly because I can refer to each button by a letter (A,B,X or Y) and the diamond shaped layout is the most common. (see Figure 1)

Mapping player input to game events

The mapping of player input to events can be done in a number of ways, but basically, it boils down to a set of rules. Each game event (such as jumping) must satisfy a number of conditions before it can be triggered. This can be expressed in pseudocode as:

If (conditions met) then (trigger event)

I am going to use two examples here, both based on a simple Mario style platform game.

Example #1: Jumping

Let’s say our primary button (A) is the jump button. The rule could be:

If Button A pressed then JUMP

But you don’t want to jump while you are in the air, so a more reasonable rule would be:

If Button A pressed and ON GROUND then JUMP

Example #2: Super Jump and Ground Pound

This is two moves performed with the same buttons. In our game, pressing the right shoulder button (R1) causes the character to crouch. Pressing A while crouched will cause the character to do a super high jump.

If you press R1 while in the air, the character will do a ground pound where they rapidly slam down into the ground. Both these moves are similar to moves in Super Mario 64.

This can be expressed as three rules:

If button R1 pressed and ON GROUND then CROUCH
If button R1 pressed and IN AIR then GROUND POUND
If button A pressed and CROUCHED then SUPER JUMP

This seems nice and straightforward. However there are numerous problems with this simple implementation. As we will see later, getting player control to work is inevitably a fiddly and complex task. In order to provide a feeling of simple intuitive control to the player you will actually have to write some rather convoluted ad hoc code.

Detecting and Recording player input

The simplest way of handling input from a joystick is to query its current state. i.e. is button X pressed right now.

This works fine for things such as acceleration and braking, which are continuous events. But for one-time events, such as jumping and shooting, you actually want to detect when the player initially presses the button. You need to respond when a button goes from “released” to “pressed” – i.e., when the button is “triggered” .

In addition, you often want to know how long a button has been pressed, and sometimes how long since a “trigger” event has occurred. You might also want to be able to flag the “trigger” event as having been handled, so it does not get continually re-used.

So, for the purposes of this article, I’m going to assume a fairly simple implementation of a joypad button interface that provides the information listed above. This is also what I’ve implemented in the sample code.

Thumb positioning – Precise Thumb vs. Sloppy Thumb

On the vast majority of games, the player rarely moves his thumb from the vicinity of these four buttons, so even when not pressing a button, they will rest their thumb over the buttons, ready to press them as needed.

There are two basic ways in which a person can hold their thumb over the buttons. I call these Precise Thumb and Sloppy Thumb. As we shall see, it is important to consider both types of thumb positioning when designing and implementing your control scheme.

Figure 2a – The “precise” player rest the tip of their thumb over the primary button.

Precise Thumb is where the player uses the tip of his thumb to press each of the four buttons. When moving from one button to another, the precise thumb player will lift his thumb off the button, and press it down on another button. (Figure 2a)

Figure 2b – the “sloppy” player rests his thumb over all the buttons – allowing a button press by just tilting the thumb..

Sloppy Thumb is where the player uses the whole thumb to operate the four buttons. There are many variants, but basically the sloppy player uses the thumb tip only to operate the uppermost button. The buttons to the sides are operated by the sides of the thumb, and the lower button is operated by either the middle of the thumb pad, or the bone at the first joint in the thumb. (Figure 2b)

In order to press a different button, the sloppy player will usually just tilt his thumb.

The Nintendo Gamecube controller is built around the expectation that the player will use some kind of “sloppy thumb” technique. There is a large central primary button, and with the three other buttons surrounding it, encouraging you to keep your thumb squarely over the primary button, and hit the other buttons with the edges and the tip of your thumb.

This radical difference in thumb position (and button layout) can lead to perceived control problems that only arise on one platform and with just one tester. But remember one tester out of your pool of testers could equate to many thousands of player having the same problem if you are making a reasonably popular game. If you have very few testers, then if someone comes to you and says “it feels wrong when I do this” , then pay close attention, because they represent a goodly chunk of your potential audience.

The Causes of Ambiguity

Ambiguity cause #1 – Imperceptible State Changes.

The player runs his character towards the chasm. When he gets to the edge, he presses the jump button. The character does not jump, and instead plunges to his doom.

Why did the character not jump? Because in the internal model of reality that the game stores, the character had already run off the edge of the cliff when they pressed the jump button. However, to the player it looked like he was right at the edge and a jump looked perfectly possible. There was a discrepancy between the perceptions of the time at which the player left the ground.

On one frame (on the ground) pressing button A will make the character jump. On the next frame (in the air) pressing A will have no effect. To the player these two frames of the game (just 0.016 seconds apart) appear identical.

This problem also occurs when jumping just before landing, or just before hitting a wall (for a wall jump)

Ambiguity Cause #2 – Change in button function based on state.

In Nintendo’s Super Mario 64 DS, when playing as Mario, pressing A to jump then R1 (R1 = Right shoulder button, use the Z button on the Nintendo 64 version), will trigger a ground pound. Pressing R1 then A will trigger a backflip. Pressing them both at the same time will cause one of: a ground pound, a backflip, or a normal jump, seemingly at random. This is bad because the user has no control; they are doing the same thing over and over, yet getting different results.

This problem also shows up in Mario when you try to do a long jump, which is done by running, then jumping by pressing A+R1. Sometimes while attempting this you will do a Ground Pound by accident. This is not the fault of the player. To the player it appeared they did everything right, but the results were not what they expected.

Now at this point you might think “hey, I’m pretty good at Mario, I don’t notice those problems, I can long jump no problem” . Sure, but to get good at it you had to train yourself to work around these control issues. You had to learn that to do a consistent long jump you had to tap R1 and between 0.05 and 0.18 seconds later tap A. This you learn by failing repeatedly when you first play the game until you get it right.

To the beginning player this is not fun. Controls that do not do what you want are frustrating. Just as bad are inconsistent controls, where sometimes the character does one thing, and sometime another, without any logic perceptible to the player. They might not be able to voice it exactly, but the game “just feels wrong” . Ignore this at your peril.

Ambiguity Cause #3 – Accidental button presses with Sloppy Thumb

Figure 3 – The “precise” player moves his thumb from A to Y by lifting his thumb off A and putting it on Y. This is a nice clean motion, and it’s easy to assume your player will do this.

When moving his thumb from one button to another when using the Sloppy Thumb technique, it’s quite likely that you may hit some other button by mistake. Particularly when going between opposite buttons

For example, when going from A to Y the “precise thumb” player will lift their thumb off A and then press Y (Figure 3) in a nice clean motion. The sloppy player will tilt their thumb forward, rocking it from A to Y and quite possibly momentarily pressing X or B. (Figure 4)

Figure 4 – The “sloppy” player moving from A to Y. The player is simply tilting his thumb forward, and in the second image accidentally presses X with the side of his thumb. At that point all three buttons are depressed. Your game will feel better if you can handle this case.

Ambiguity Cause #4 – Release and Press vs. Press and Release

It’s common in games for an action to involve releasing one button, and then pressing another in rapid succession.

This can happen in two ways. Say when moving from A to X. The precise thumb player Releases X, then Presses A (figure 5) whereas the sloppy player will Press A, then release X, so for a brief period of time, A and X are both pressed.(figure 6)

Figure 5 – Precise movement from A to X.

Figure 6 – Sloppy movement from A to X. Note how at one point both A and X are pressed together.

This can lead to problems if you have an event triggered by A+X being held. Or if your event tied to X relies on no other buttons being pressed.

Measuring Ambiguity

In dealing with all these problems, the most important tool is one that gives you the ability to see exactly what is happening, and more importantly “what just happened” . People are going to show you control problems, and you are going to have to figure out the precise sequence of events that caused that problem to occur.

The first thing to implement is a simple log file. Every time an event of interest occurs, you just print out that event, along with the time and any other useful information. Then when the control problem crops up, you just look at the log file and see what caused the problem.

In the example game I’m using the “OutputDebugString” Win32 function, since there are numerous programs designed to capture and filter these debug strings. I use a free one called “DebugView” by Sysinternals (www.sysinternals.com)

Example: consider the situation in Figure 4, the corresponding output log looks like this:

14.218: + Pressed A
14.218: Event Jump
14.418: + Pressed X
14.418: + Pressed Y
14.484: – Released X
14.484: – Released A

The number on the left is the time in seconds. The player presses A (fig 4a) to jump (which happens in the same frame). Then the player rolls his thumb forward (figure 4b) pressing Y and X simultaneously. As the thumb rolls forward fully onto Y, they release X and A simultaneously.

Now consider the case for one the ambiguity problems we listed above, the “Super Jump” vs. ” Ground Pound” . Remember the problem was that pressing A and R1 at the same time resulted in inconsistent results. Sometimes a Super Jump, and Sometimes a Ground pound.

Here’s the output for a ground pound:

21.199: + Pressed A
21.199: Event Jump
21.215: + Pressed R1
21.215: Event Gnd_Pnd
21.232: Event Land
21.249: Event Crouch
21.432: – Released R1
21.432: Event UnCrouch
21.465: – Released A

Here’s the output for a super jump

33.778: + Pressed R1
33.778: Event Crouch
33.794: + Pressed A
33.794: Event SuperJump
33.961: – Released A
33.977: – Released R1

Note that in the first listing A is pressed before R1 (by 0.016 seconds), so the game triggers a jump first, and then a ground pound.

In the second listing R1 is pressed 0.016 seconds before A is pressed, meaning the game triggers the crouch, then as we are crouched, it triggers a Super Jump when A is pressed.

To the player, it looks and feels like they attempted to do the same thing both times. They simply pressed A and R1 simultaneously. There is no way whatsoever that they can distinguish between the two events described above. The response of the game is inconsistent and ambiguous. There is no clear link formed in the player’s mind between action and response. They get annoyed. Your job is to stop this happening.

Log files are useful but can be rather dense, and difficult to pore over. The lines all look alike, and the problem can be hidden in a dense flurry of events. The next step is to present this information in a graphical format.

Figure 7 – The test application; a simple platform game with the input debug graph shown on the top.

This is what I’ve done in the example platform game that accompanies this article. All I did was add a simple “watcher” class which would record the value of a variable (such as a button up/down state, or a physics state or flag) every frame and then display this as a scrolling state graph across the top of the screen, with a separate line for each variable that was being watched. (Figure 7)

To this state graph I added an event recorder which recorded events (jump, land, fall, super jump, late jump and crouch) and displayed them on the graph as a vertical line, labeled with the event.

Finally I made the graph able to be scrolled and zoomed in and out by the joypad when the game is paused.

So, whenever some control problem occurs it’s very easy to pause the game, and then scroll over and zoom into the area on the graph that caused the problems.

Figure 8 – Timing graph for tapping A and X with Precise Thumb (top) and Sloppy Thumb (bottom).

Take a look at a simple example, sans events. Figure 8a shows the state graphs for the A and X buttons that correspond to a precise player tapping from A to X. When the line is up, that means the button is pressed. The lines along the bottom represent 1/10th of a second intervals. So we can see that we have a series of button presses for each button, each lasting about 1/10th of a second, separated by times where no button is pressed for about 1/10th of a second. This is typical of a precise thumb player.

Now in figure 8b we see the exact same situation but with a “sloppy” player. Here the presses last much longer 2/10ths or 3/10ths of a second, and the button presses overlap, with A and X being pressed at the same time when moving between buttons. Note thought that the total time between the first press of button A, and the last press is about the same in both examples.

Figure 9 – Showing a jump attempt that failed by just 0.016 seconds.

Let’s look at the late jump problem. Player runs to the edge of the cliff and jumps. They don’t jump, and instead fall to their doom. The graph for this is in Figure 9. I’ve added a Watcher on the “Air” state so you can see when it transitions between ground and air. We can actually see that the player hit the jump button just 16.7ms after leaving the ground. Not only is this amount of time too small to recognize, but the frame display the player off the edge of the cliff WILL NOT EVEN HAVE BEEN RENDERED YET! The player is not going to understand that your internal representation of his position indicates he can’t jump. He’s just going to be annoyed that it looked like he should jump and he did not.

How to fix this problem? Well, the slightly non-intuitive thing to do is to allow the player to jump for a short period after they have left the ground. Typically this period will be 0.2 seconds or less. In my sample game it’s 0.1 seconds, but in a 3D game you’ll probably use a longer time as the visual representation is less precise.

The new rule for this is:

If button A pressed and IN AIR and ON_GROUND 0.1 seconds ago then JUMP

This allowing a late jump removes the uncertainty – if the player jumps at the edge, then they will always jump. They do not have to be accurate. In fact they would have to be very inaccurate to miss the jump.

Figure 10 – The late jump in action. The player is allowed to jump even though he’s been falling for a couple of frames.
.Compare with the end of figure 9.

Figure 10 illustrates this solution in action. You see A is pressed two frames after the player starts to fall. But we still allow a jump.

Will this late jump look odd? Generally it will look just fine. Sometimes it looks a little unusual if the player jumps at the very extremity of the late jump window. Also you have to make sure your animation and sound don’t glitch as you momentarily go from ground to fall to jump in the course of 0.1 seconds. But the player will tolerate a little oddness now and then far more than they will tolerate unresponsive controls.

Figure 11 – The player tries to super jump by couching then jumping, but accidentally jumps one frame before crouching causing an instant ground pound.

Now our second problem – the Super Jump vs. Ground Pound for which we printed out the event logs earlier. Figure 11 illustrates this situation. We see that A is pressed causing us to jump. Then on the next frame, R1 is pressed in the air so we immediately get a ground pound and then land again, all in the course of 0.033 seconds.

Then the other case is show in Figure 12, where the Player hits R1 0.016 seconds before A, so we get a crouch, and then we trigger a Superjump. Again the difference between the sequence of events in Figures 11 and 12 is just 0.016 seconds, 1/60th of a second. Not enough time to blink!

Figure 12 – With just a fraction of a second difference in input, the player successfully does his Super Jump.

How to fix this problem? First of all, let’s decide what pressing A+R1 should actually do. Since we are on the ground, the player is probably trying to do a super jump. So all you have to do is add another rule similar to the late jump:

If A+R1 pressed and IN AIR and ON_GROUND 0.1 seconds ago then SUPERJUMP

Strategies for Resolving Ambiguity

I’ve presented some specific solutions here for two examples of ambiguity by adding additional “disambiguation rules” to your player control logic. I believe that to achieve truly intuitive and responsive player control, this type of disambiguation rule is going to be your primary tool.

Disambiguation rules are very application specific; they also can end up being rather complicated. It is important not to shy away from adding complex layers of rules simply because it makes your code (or data) look messy. If you detect a control ambiguity, and you can add a rule, then go ahead and add it.

The logic you are implementing is NOT elegant. You are creating a interface between a highly complex multi-state and multi-variable simulation, and a very ill-defined set of potential physical players. It is inevitable that there will be a high degree of ambiguity which can only be resolved by a similar degree of complexity, and that your code and data will need constant adjustments.

To fix a control ambiguity problem, first find out EXACTLY what is happening by means of tools such as the ones I have described herein. Then ask: “for this state and this input, what should happen” . The add disambiguation rules to make that happen. Test. Repeat.

Final Thoughts

Many controllers have analog buttons, where you can tell how hard the player is pressing a button. You can get more information about the intentions of the player by taking this into account (for example, an accidental press will often have a lower pressure than an intentional press). This does however greatly raise the level of complexity needed.

As well as buttons; you have the analog sticks. With its far greater number of states it can be difficult to disambiguate. The state graphs I’ve show above can easily be extended to include analog values.

An ideal framework for player control development would allow you to record the game state and inputs for the duration of the problem, and then replay it, editing the rules on the fly until the problem is resolved.

Game designers, programmers and testers need to be aware of the problems of input ambiguity. By educating the testers, you can have them analyze their own input when they encounter a difficulty. Testers are often highly skilled players, used to accommodating frustrating controls. Teach them not to accept the slightest annoyance, and instead figure out exactly what they are doing to trigger it. Your game should be accessible to as many people as possible.

Cowboy Programming Game Development and General Hacking by the Old West