Concepts: Validation Rules

TLDR: the most important part of a DNA is the Validation Rules (rules of the game), which determine which actions are valid and which aren't. If someone tries to break them, an immune response is triggered.

Validation Rules

Validation Rules are probably the most important mechanism in Holochain. They are encoded in the DNA, and define what is valid and what is not valid in the context of it.

You can think of it this way: if we all agree that we are playing soccer, then when someone touches the ball with their hand, we all know that that's invalid in the context of this game. But if we are playing basketball, that's obviously allowed.

Validation Rules need to be deterministic. In short this means that they need to give back the same result (valid or invalid) for any given element, no matter when they are run, by whom, or in which circumstances.

The Immune System

But! What happens if someone breaks the rules? How does Holochain maintain data integrity if some malicious agent publishes bad data to the network?

Try it!

Here you can see a network of 10 agents with 1 malicious node (marked with 😈).

In this DNA there is only one validation rule: anyone can create posts, but only the author can update their own posts.

In this scenario our malicious agent will try to update a post created by another agent, which is not allowed. Let's see what happens when you click "Run".

You can refresh the page and run it as many times as you want, and you can also enable the "Step by Step" mode to have a closer look.

So! If you look closely, all the connections between the malicious agents and the other nodes have been closed. Effectively the malicious agent has been booted out of the network, and the other agents won't talk to it again.

Here is a brief description of what has happened:

The malicious agent was able to update its own chain without any problems (after all, they control the code that is running on their computer).
Then they published the update to the DHT, to random nodes that are called "validators".
When the validators received that publication, before accepting it they have run the validation rules for our DNA.
In this case the update was not valid, and since we all agreed to play by the same rules just by being present in this DNA, we know that the malicious agent has cheated.
We then proceed to sound the alarm, creating warrants for everyone in the network to make sure they know about the cheating.
The nodes that receive those warrants run the validation rules themselves to know for sure, and close any connection they had with the cheating agent and emit new warrants.

But what happens with 51% of malicious nodes?

But that surely cannot sustain a 51% attack... can it?

In this other scenario, we have a network of 10 nodes, with only 4 of them being honest nodes. This network has the same rules as the last one: one agent cannot update another agent's post.

When you run this scenario, one of the malicious agents is going to update a post from another agent. What the other malicious agents will do is pretend that that action was valid, trying to convince the honest nodes that the update was valid in reality.

Try for yourself: click "Run" and see what happens.

The honest nodes have been able to separate themselves into their own network, effectively excluding the other ones from participating.

How did the honest nodes know who was cheating? Every time an agent validates an element, they compare the result of their own validation with the validation done by other agents. If the results are different, that must mean that the other validator is trying to cheat as well (because remember, validation rules are deterministic!), so we close the connection as well.

Keep in mind that this representation and scenario are a bit simplistic in favor of clarity: in reality, you will have different options on what to do about cheating agents, and how your DNA wants to react to different offenses in the network.

Caught a mistake or want to contribute to the documentation? Edit this page on GitHub!