technology:opinions:abiasedguidetounittests
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
technology:opinions:abiasedguidetounittests [2021/09/05 16:57] – [The know-it-all function fallacy] Owen Mellema | technology:opinions:abiasedguidetounittests [2021/09/06 22:52] (current) – Owen Mellema | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== A Biased Guide to Unit Tests ====== | ====== A Biased Guide to Unit Tests ====== | ||
+ | //This article was written to explain my thoughts on how Unit Testing should be done. Your mileage may vary.// | ||
===== Introduction ===== | ===== Introduction ===== | ||
Testing is important. It's really fucking important. Yes, it can be annoying, but so is checking that you have your parachute on before jumping out of a plane. Despite this, the world of thought surrounding unit testing still seems to be in its infancy compared to thought about general programming. Many very competent developers will rush through the process of unit testing, making significant errors in the process, and many other competent developers will look at the unit tests and not see anything wrong with them. | Testing is important. It's really fucking important. Yes, it can be annoying, but so is checking that you have your parachute on before jumping out of a plane. Despite this, the world of thought surrounding unit testing still seems to be in its infancy compared to thought about general programming. Many very competent developers will rush through the process of unit testing, making significant errors in the process, and many other competent developers will look at the unit tests and not see anything wrong with them. | ||
Line 30: | Line 31: | ||
< | < | ||
- | class FibonacciTest | + | class TicketTest |
public static int PERSON_ID = 5; | public static int PERSON_ID = 5; | ||
@Nested | @Nested | ||
Line 79: | Line 80: | ||
In this article, I will propose my definition of what makes a good unit test. | In this article, I will propose my definition of what makes a good unit test. | ||
- | ===== The Topology of Testing | + | ===== Value Hierarchies ===== |
+ | |||
+ | When we write functional code, there is a hierarchy of value that defines quality. | ||
+ | - Correct: It should do what we expect. | ||
+ | - Efficient: It should not waste time or space. | ||
+ | - Resilient: Changes in how this code is used, or how other code is defined, shouldn' | ||
+ | - Extensible: We should be able to make changes to the code without a massive amount of work. | ||
+ | - Readable: Humans should be able to understand what is happening. | ||
+ | |||
+ | When we write testing code, however, we have different requirements. | ||
+ | - Honest: The code should test the SUT. | ||
+ | - Comprehensive: | ||
+ | - Explicit: The code should " | ||
+ | |||
+ | This difference in value means that applying best practices from one paradigm to the other causes poor outcomes. For example, when writing functional code, it is considered an antipattern to copy and paste code. Instead, a good program has similar tasks being handled in the same way. Developers will sometimes fall into the trap of thinking that this is for their benefit, because it is more convenient. While this is true, the more important reason that copying and pasting is considered an antipattern is because it is likely to introduce bugs, therefore causing problems with our most important functional requirement: | ||
+ | |||
+ | However, I would argue that, for testing, copying and pasting is not an antipattern. This is because correctness is not a concern when writing tests. While reducing repeated code can be good for comprehension in some circumstances, | ||
+ | ===== Writing Honest Tests ===== | ||
+ | ==== The Topology of Testing ==== | ||
Tests are, ideally, all about data. We have some SUT, an input, and a desired output (or outcome). We feed the input into the SUT and check if the result matches the desired output. Below, I have mapped out what this looks like in a pseudo-flow chart. | Tests are, ideally, all about data. We have some SUT, an input, and a desired output (or outcome). We feed the input into the SUT and check if the result matches the desired output. Below, I have mapped out what this looks like in a pseudo-flow chart. | ||
{{ : | {{ : | ||
Line 290: | Line 309: | ||
Let's say we have a function A, and another function that computes the correct output of A, called B. If B is //less correct// than A, why are we using it? If it is //more correct// than A, why don't we replace A with B? If it is //equally correct// as A, what's the point in using it in assertions? In reality, there' | Let's say we have a function A, and another function that computes the correct output of A, called B. If B is //less correct// than A, why are we using it? If it is //more correct// than A, why don't we replace A with B? If it is //equally correct// as A, what's the point in using it in assertions? In reality, there' | ||
- | This leads me to my next rule of testing: **The only logical component in a test (excluding the packager and extractor) should be the SUT.** | + | This leads me to my next rules of testing: **The only logical component in a test (excluding the packager and extractor) should be the SUT.** |
+ | |||
+ | ==== Skipping the Extractor ==== | ||
+ | Another idea that we might have is to skip the extractor by converting the expected output into an object that can be directly compared with the output of the SUT. That would look something like this: | ||
+ | |||
+ | < | ||
+ | //Define input | ||
+ | int input = 0; | ||
+ | |||
+ | //Package input | ||
+ | FibonacciCriteria fibonacciCriteria = mock(FibonacciCriteria.class); | ||
+ | when(fibonacciCriteria.getNumber()).thenReturn(input); | ||
+ | |||
+ | //Feed packaged input into SUT | ||
+ | FibonacciResponse packagedOutput = Fibonacci.fib(fibonacciCriteria); | ||
+ | |||
+ | //Create a packaged expected output. | ||
+ | FibonacciResponse packagedExpectedOutput = FibonacciResponse | ||
+ | .builder() | ||
+ | .result(0) | ||
+ | .build(); | ||
+ | |||
+ | //Assert that the actualOutput matches what we expect | ||
+ | assertEquals(packagedExpectedOutput, | ||
+ | </ | ||
+ | |||
+ | And we would represent it like this: | ||
+ | |||
+ | {{ : | ||
+ | |||
+ | People generally consider this when they have outputs with a lot of values in them that need to be tested. This isn't as big of a deal as the other mistakes I have outlined above. It doesn' | ||
+ | |||
+ | The biggest problem is that we are implicitly trusting that the .equals() method will give us a meaningful result for our object. You probably know that in Java, when the .equals() method is not defined, calling .equals() will compare the object' | ||
+ | |||
+ | That's not a big deal with a simple object comprised of a few primitive fields. Just add a .equals() method (Using lombok makes this dead easy). But what if the object is comprised of other objects, themselves without a .equals() method? Again, no problem, just add a .equals() to those objects. But what if those objects are in different repositories that you own? Now it's becoming a pain, but still manageable. Just open a PR for those repositories, | ||
+ | |||
+ | And all that's assuming that you know where the problem is! If you have a data structure that relies on hash codes to function, like a Map or Set, and somewhere along the line some jackass didn't implement .hashcode() correctly, it's anyone' | ||
+ | |||
+ | Now, you might object that if .equals() isn't implemented correctly, that could be a problem with the SUT. I agree. But it also might not be a problem with the SUT. Maybe someone has implemented a superclass differently than expected. Maybe we expect .equals() to work in a different way. But all that's besides the point, because .equals() should be tested on its own if it's so important! ((There are libraries that can do this for you, if it's a concern. Despite this, outside of when I was in training, I have never seen anyone write unit tests of a .equals() method)) | ||
+ | |||
+ | Also, testing this way obscures specific problems with the code when they arise. If only one field is off, then the entire object is considered to be incorrect. There' | ||
+ | |||
+ | Along those same lines, when something changes with the expected outputs, it's a slippery slope for an engineer to revert back to using the " | ||
+ | |||
+ | Finally, I would argue that this approach doesn' | ||
+ | |||
+ | So, save yourself the headache, and don't skip the extractor. The rule here is: **One side of the assertion should be connected directly to the expected output**. | ||
+ | |||
+ | ==== Summary ==== | ||
+ | The rules we have discussed in this section are: | ||
+ | - One side of the assertion should be connected to the SUT. | ||
+ | - The other side of the assertion should be connected directly to the output. | ||
+ | - The only logic in the test should be in the SUT. | ||
+ | |||
+ | ===== Writing Comprehensive Tests ===== | ||
+ | I'm not going to write about this, as our understanding of testing edge cases is pretty good. | ||
+ | ===== Writing Explicit Tests ===== | ||
+ | When we write tests, we are writing the most complete and accurate documentation of the functionality of our SUT. If I write a javadoc comment explaining what the SUT does, the reader must take my word for it that this is what it does. For all the reader knows, I might be lying (or just incorrect). However, when we write tests, our documentation is self-proving. The reader can run the tests and be assured that this is how the SUT will act. | ||
+ | |||
+ | Despite this, since we write tests using code and code is not necessarily easy to read, we must make a special effort to ensure that our testing code is readable. Luckily, if you are following my advice from above, the logical complexity of the testing code should be low. | ||
+ | |||
+ | ==== Using Given-When-Then Correctly ==== | ||
+ | The Given-When-Then paradigm is, in most cases, the best way to organize tests. My only ask is that when you write GWT tests, you do it //the right way//. Otherwise, you'll just be cluttering up your codebase with unnecessary nesting. | ||
+ | |||
+ | The biggest strength of GWT is that every part of the testing topology is represented by a concept in GWT: | ||
+ | * **Given** corresponds to defining and packaging input. | ||
+ | * **When** corresponds to calling the SUT and extracting. | ||
+ | * **Then** corresponds to extracting((This can also be done in the When step, if it is more convenient)) and asserting an output. | ||
+ | |||
+ | {{ : | ||
+ | |||
+ | A good strategy for accomplishing this is to define your input and output variables in your root class, and then use a @BeforeEach to set a variable to what you want it to be for the test. Here's what that would look like for testing the 0 case for Fibonacci. | ||
+ | |||
+ | < | ||
+ | @Nested | ||
+ | class FibonacciTestGWT { | ||
+ | FibonacciCriteria criteria; | ||
+ | FibonacciResponse response; | ||
+ | |||
+ | @Nested | ||
+ | class GivenAnIOf0 { | ||
+ | @BeforeEach | ||
+ | void init() { | ||
+ | criteria = mock(FibonacciCriteria.class); | ||
+ | when(criteria.getNumber()).thenReturn(0); | ||
+ | } | ||
+ | |||
+ | @Nested | ||
+ | class WhenCallingFibonacci { | ||
+ | @BeforeEach | ||
+ | void init() { | ||
+ | response = Fibonacci.fib(criteria); | ||
+ | } | ||
+ | |||
+ | @Test | ||
+ | void thenResultIs0() { | ||
+ | int result = response.getResult(); | ||
+ | assertEquals(0, | ||
+ | } | ||
+ | } | ||
+ | } | ||
+ | } | ||
+ | </ | ||
+ | ==== Effective abstraction ==== | ||
+ | Here's a few ideas for cleaning up your tests with abstraction. Remember, in all of this, the name of the game is // | ||
+ | * Do as little packaging in your setup methods as possible. Instead, move the heavy lifting of packaging to dedicated private methods, like this: | ||
+ | < | ||
+ | void init() { | ||
+ | criteria = createMockCriteria(0); | ||
+ | } | ||
+ | |||
+ | private FibonacciCriteria createMockCriteria(int number) | ||
+ | { | ||
+ | FibonacciCriteria criteria = mock(FibonacciCriteria.class); | ||
+ | when(criteria.getNumber()).thenReturn(number); | ||
+ | return criteria; | ||
+ | } | ||
+ | </ | ||
+ | * If you need more than one parameter in your setup method, consider using a DTO with a builder to increase comprehension. | ||
+ | < | ||
+ | void init() { | ||
+ | criteria = createMockCriteria(FibonacciCriteriaMockDTO | ||
+ | .builder() | ||
+ | .number(0) | ||
+ | .build()); | ||
+ | } | ||
+ | |||
+ | private FibonacciCriteria createMockCriteria(FibonacciCriteriaMockDTO fibonacciCriteriaMockDTO) | ||
+ | { | ||
+ | FibonacciCriteria criteria = mock(FibonacciCriteria.class); | ||
+ | when(criteria.getNumber()).thenReturn(fibonacciCriteriaMockDTO.number); | ||
+ | return criteria; | ||
+ | } | ||
+ | |||
+ | @Builder | ||
+ | class FibonacciCriteriaMockDTO { | ||
+ | int number; | ||
+ | } | ||
+ | </ | ||
+ | * Don't abstract away the actual calling of the SUT. After all, that's the most important part. | ||
+ | * Writing helper assertion methods can be helpful, but they can also confuse readers if not done right. Follow these tips to write a good helper assert. | ||
+ | * Your assertion should take two arguments at most (excluding housekeeping arguments like " | ||
+ | * The first argument should be the packaged output, and the second argument should be the primitive to test. | ||
+ | * The name of the helper should begin with " | ||
+ | * All asserts in the helper assertion method should be conceptually related. If possible, use just one. | ||
+ | < | ||
+ | @Test | ||
+ | void thenResultIs0() { | ||
+ | assertResult(response, | ||
+ | } | ||
+ | |||
+ | private void assertResult(FibonacciResponse response, int expectedResult) | ||
+ | { | ||
+ | int result = response.getResult(); | ||
+ | assertEquals(expectedResult, | ||
+ | } | ||
+ | </ |
technology/opinions/abiasedguidetounittests.1630861020.txt.gz · Last modified: 2021/09/05 16:57 by Owen Mellema