Top "Fault-tolerance" questions

Fault tolerance refers to a system's capability to isolate, compensate for and recover from failure with minimal impact to the end user.

Compiling an application for use in highly radioactive environments

We are compiling an embedded C++ application that is deployed in a shielded device in an environment bombarded with ionizing …

c++ c gcc embedded fault-tolerance
Do I absolutely need a minimum of 3 nodes/servers for a Cassandra cluster or will 2 suffice?

Surely one can run a single node cluster but I'd like some level of fault-tolerance. At present I can afford …

nosql cassandra fault-tolerance
Hystrix: Custom circuit breaker and recovery logic

I just read the Hystrix guide and am trying to wrap my head around how the default circuit breaker and …

java fault-tolerance hystrix
Fault Tolerance in MapReduce

I was reading about Hadoop and how fault tolerant it is. I read the HDFS and read how failure of …

mapreduce distributed-computing fault-tolerance
Why is C++ template use not recommended in a space/radiated environment?

By reading this question, I understood, for instance, why dynamic allocation or exceptions are not recommended in environments where radiation …

c++ templates embedded fault-tolerance
Akka Actor not terminating if an exception is thrown

I am currently trying to get started with Akka and I am facing a weird problem. I've got the following …

scala routing actor akka fault-tolerance
Scala + Akka: How to develop a Multi-Machine Highly Available Cluster

We're developing a server system in Scala + Akka for a game that will serve clients in Android, iPhone, and Second …

scala high-availability fault-tolerance akka
Spring batch: Fault Tolerant

I have the following Step: return stepBuilderFactory.get("billStep") .allowStartIfComplete(true) .chunk(20000) .reader(billReader) .processor(billProcessor) .faultTolerant() .skipLimit(Integer.MAX_…

spring spring-batch fault-tolerance
quartz jobDetail requestRecovery

The documentation for JobDetail.requestsRecovery property states the following Instructs the Scheduler whether or not the Job should be re-executed …

quartz-scheduler fault-tolerance
Are Erlang/OTP messages reliable? Can messages be duplicated?

Long version: I'm new to erlang, and considering using it for a scalable architecture. I've found many proponents of the …

transactions erlang messaging reliability fault-tolerance