Make Your Data FABulous
The CAP theorem is widely known for distributed systems, but it’s not the only tradeoff you should be aware of. For datastores there is also the FAB theory and just like with the CAP theorem you can only pick two: Fast: Results are fast enough so that people can have a seamless interaction. Accurate: Answers are accurate and don’t have a margin of error. Big: Dozens or hundreds of systems are involved in calculating the result. Most SQL databases are in the FA space whereas Hadoop and related systems are generally AB systems. A system optimized for FB is Elasticsearch for example. While Fast and Big are relatively easy to understand, Accurate is a bit harder to picture. This talk shows some concrete examples of accuracy tradeoffs Elasticsearch has taken and how to optimize them for your usecase.