What to do when Fauna goes down?

robertsamarji · March 13, 2021, 11:27pm

I’m a big fan of Fauna. However, I’ve been not been fully comortable relying on it as an app’s primary database.

I’ve experienced rare but multiple instances where transactions have failed because of a service outage. A 15 minute outage can cause hard to fix problems, particularly where later events rely on the transactions that failed during the outage.

If I were to rely on Fauna as my primary database going forward, what would be the best thing to do in this cases like this (short of taking down my entire service)?

I already wrap write queries in a retry function to bypass socket hang up errors, potentially related to node-fetch. So, a solution I’ve been mulling over is to extend this so that transactions that have failed X times get sent somewhere (not Fauna) to be stored, so I can later insert them when Fauna is back up.

Does anyone else have any better ideas?

ewan · March 15, 2021, 4:49pm

Your strategy seems quite reasonable.

Have you been experiencing many outages? Do they correlate with the incidents listed on our status page? Fauna Status - Incident History

robertsamarji · March 15, 2021, 5:36pm

Hey Ewan. No just the ones that correlate with the outages on the status page. Last month’s outage caused a few problems so just thinking how I could minimise any side effects of similar events in the future.

Good to know it seems okay!

Ambroise · March 18, 2021, 7:51pm

You could fix those issues by not considering changes are made in real-time but are eventually made.

Basically, what you express isn’t fauna-specific, any online software has such issues, AWS has too.

What people usually do is to use an event queue instead of hitting services (api, etc.) directly. Everything goes to the queue and services consume the queue.

This way, in the case of an outage, the queue acts as source of truth for every service.

This is more a design (architectural) problem, it’s basically an issue that’s quite common with the serverless architecture.

robertsamarji · March 18, 2021, 8:35pm

Hey @Ambroise, that’s a really great suggestion. Thank you for raising it.

I’ll probably dig into event queues then rather than saving failed transactions. Is there any service you would recommend to use with Fauna?

Ambroise · March 18, 2021, 9:02pm

I don’t have any experience with actual implementation of service queue.

I know AWS SQS is famous for this SQS Service de messagerie | Amazon Simple Queue Service (SQS), I’m not aware of anything related to FaunaDB itself, those are usually pretty agnostic. SQS is probably a good choice but you’ll need to dig in deeper and see for yourself if it fits your needs.

robertsamarji · March 18, 2021, 9:22pm

Thanks Ambroise. SQS was the service I had in mind. I’ll give it a go.

Topic		Replies	Views
What happens when the client goes offline? Help	1	391	March 2, 2021
Increased frequency of UnavailableExceptions Help	2	233	April 22, 2021
Strategies to maintain an on-premise version of your application Help	2	853	July 30, 2022
Request timeout: internal server error 500 Help	1	465	February 8, 2021
Transactional outbox / change capture Help	3	317	June 14, 2022

What to do when Fauna goes down?

Related topics