Skip to the content.

What you need to know to build a serverless communication system

How our startup designed an event-driven architecture to automate e-mails in a cost effective way

June 9, 2020

As part of our product, LegUp communicates with families and day care providers. Automating this workflow is an important part of our ability to scale. As a scrappy startup, we wanted a solution that allows us to build something scalable at an affordable price. We decided to use AWS’s Simple Email Service (SES) to help manage our e-mails. To provide scalability and avoid throttling when sending bulk e-mails, we built an event-driven system. This system integrates with Simple Queue Service (SQS) and Simple Notification Service (SNS). It is managed through a Lambda function.

If you’ve tried using SQS or other event-bases systems, you’ve likely encountered the problem of idempotency. The same message can be delivered multiple times.

If you’ve tried using SQS or other event-bases systems, you’ve likely encountered the problem of idempotency. The same message can be delivered multiple times. Additionally SNS can return details of message receipt before SES returns. This gives you a lot to manage to ensure that you send outgoing mails once and only once! In this blog post, I’ll explain howLegUp set this up to deal with these issues and provide some Typescript code snippets.

Architecture

LegUp's Serverless architecture for processing e-mails

Our architecture consists of several AWS components:

Let’s talk through how these components interact as we walkthrough the flow of sending out e-mails.

Initial API request

When our primary API receives a request, it may result in a need to send hundreds or even thousands of e-mails. We don’t want to block while it sends out mails. Even if we did try to send mail from this primary worker, SES limits the number of e-mails we can send per second. Instead we hand this task off to another process, spacing these requests to avoid throttling. To set this up, we post messages with the content of our mail into SQS. We take the contents of the mail (recipient, subject, and message) and save these as the MessageBody. Because we manage different types of communications, we add metadata to specify this as a plain-text e-mail message.We do this using custom MessageAttributes, channel and type as shown in this code snippet.

export async function sendEmail(to: string, subject: string, message: string): Promise<boolean>{
  try {
    const msg = {
      MessageBody: JSON.stringify({to, subject, message}),
      DelaySeconds: ((delay++) % 900),
      MessageAttributes: {
        channel: {
          DataType: “String”,
          StringValue: “email”
        },
        type: {
          DataType: “String”,
          StringValue: “plaintext”
        }
      },
      QueueUrl: process.env.emailQueueUrl,
    };
    const data = awaitSQS.sendMessage(msg).promise();
    success = !!data;
  } catch (e) {
    success = false;
  }
  return success;
}

Note the use of the global delay variable passed in to DelaySeconds. This allows us to queue a large number of messages and delay sending them to avoid per-second limits.

Queue Set Up

Setting up the queue is simple and can be done in the AWS Management Console.

Once you’ve created the queue you’ll need to create a Lambda function to processes messages. Start by creating an empty Lambda function to get its ARN. Once you’ve done that, you can add a trigger for the Lambda function by following these steps:

LegUp Communications System

LegUp Communications System

As noted, we set up our Queue as a standard queue which provides at-least-once message delivery. It is possible that our Lambda function will receive the same SQS message more than once. We don’t want e-mail recipients to receive duplicate messages. We need some level of idempotency within our handling code. To achieve this, we use a table within a Postgres database which allows us to look up details of this message. This table of sent messages will also come in handy when we process the recipient’s delivery notification.

This atomic operation lets us either add a new entry into our database or update it if it already exists.

The table definition is as follows:

CREATE TABLE IF NOT EXISTS messages(
  email_id bigserial UNIQUE, — email ID
  message_id VARCHAR(128), — ID of the message, returned from SES
  recipient VARCHAR(128), — e-mail address sent to
  sent_time timestamptz, — sent timestamp from mail
  delivered_time timestamptz, — delivery timestamp from recipient (if delivered properly)
  delivered boolean, — set to true if the message was delivered
  msg_queue_id VARCHAR(128) UNIQUE, — Message Queue ID of add (to check for duplicates)
  updated boolean, — Auxiliary column indicating whether an entry was added or updated
  created_at timestamptz DEFAULT now() NOT NULL
);

The msg_queue_id and updated fields help us achieve idempotency when handling SQS messages.

When we see a new message, the first thing we do is attempt to add an entry into our DB using the new queue ID. We use an upsert to achieve this. This atomic operation lets us either add a new entry into our database or update it if it already exists. Here is the SQL statement to achieve this:

INSERT INTO messages(message_id, recipient, msg_queue_id, updated) VALUES ($1, $2, $3, false) ON CONFLICT (msg_queue_id) DO UPDATE SET message_id = EXCLUDED.message_id, recipient = EXCLUDED.recipient, updated = true RETURNING updated, email_id;

The updated field indicates whether we created a new entry or updated an existing one. If it was an update, we should stop processing this message. Otherwise we’ll post a new message to SES. After we call SES, we’ll get a message_id and can call the same SQL query as above to set this messageID.

SNS integration and notifications

After we’ve sent an e-mail out, we’re interested in knowing if it was delivered or bounced. We use SNS to receive a notification after the recipient’s mail server has processed our message. We reuse the LegUpCommunications System to process this receipt. To do this, we need to make three sets of changes:

  1. First, we need to create the notification topic in SNS.
  1. Second, we need to set our Lambda function as a subscriber to this topic
  1. Third, we need to configure SES to trigger these notifications

Our notification includes the message ID (the return result from the call to SES), along with delivery details. But order is not guaranteed. It’s possible to receive an SNS notification before the call to SES has returned. This makes it difficult to match based on this message ID!

To solve for this, we can use the email_id that was set in the database during the call to enforce idempotency. We made this call before the call to SES, so we know that we’ll have this value in the callback notification. But to get this, we need to set a custom e-mail header with this value. We named ours “X-Corrleation-ID.” To send a custom header, you’ll need to use the sendRawEmail function rather than the simpler sendEmail call. But it’s not too hard to set up as you can see from the following code snippet:

export async function sendEmail(email_id: string, from:string, to: string, subject: string, message: string): Promise<string | null> {
  const mailMessage = `From: ‘LegUp’<${from}>\nTo: ${to}\nX-Correlation-ID: ${email_id}\nSubject:${subject}\nContent-Type: text/plain; charset=”us-ascii”\nMIME-Version: 1.0\n\n${message}`;
  let messageId: string | null = null;
  try {
    const data = await new AWS.SES({apiVersion: “2010–12–01”}).sendRawEmail({
      RawMessage: { Data:Buffer.from(mailMessage)},
    }).promise();
    messageId = data.MessageId;
  } catch (e) {
    messageId = null;
  }
  return messageId;
}

Now, within the callback that handles the SNS notification, we look for this header and use that to correlate with the message that is saved in our database

// We need a message ID else we can’t do anything
if (!event || !event.mail ||!event.mail.messageId || !event.mail.headers) {
  return false;
}
// The headers are given as an array of name/value JSON objects — find the one with the correlation ID. This ID can be used to retrieve details or set information in the messages table in our database
const email_id: string | undefined = (event.mail.headers.find((h: {name: string, value: string}) => h.name ===”X-Correlation-ID”) || {}).value;
if (!email_id) {
  return false;
}
const sent_time = event.mail.timestamp;
let status: “delivered” |”bounced” | “complaint”;
if (event.notificationType ===”Delivery”) {
  status = “delivered”;
} else if (event.notificationType ===”Bounce”) {
  // Note you can also look at the event.bounce structure for more details about this bounce
  status = “bounced”;
} else if (event.notificationType ===”Complaint”) {
  // Note you can look at the event.complaint structure for more details about the complaint
  status = “complaint”;
}

Summary

There are several moving pieces to building a scalable automated e-mail system with AWS technologies. Idempotency, ensuring duplicate mails aren’t sent, and managing recipient mail server responses are possible with the use of some technologies beyond SES. I hope this guide gives you an idea on one way you can achieve this.

Have you built e-mail automation systems and have something you’d like to share? Please leave your comments and let me know about your experiences — I’m always looking to grow my knowledge!‍


Leave a comment

Comments are powered by Utterances. A free GitHub account is required. Please be respectful. No swearing or inflammatory language. No spam.
I reserve the right to delete any inappropriate comments.