README
twilio-interactive-flow
This library makes it easy to build a phone tree/IVR system using twilio.
Note: although this library has good test coverage and I've had no problems using it in production, it's still pre-1.0 and the API may change as call session functionality is added.
Usage
The library exports one primary function, which takes all the states in your call system and returns an express app that can handle all the HTTP requests needed to interact with twilio to make your call system work.
const twilioIvr = require("twilio-interactive-flow").default;
// Define an array of your states. See below for how states work.
const states = [];
const config = {
twilio: { authToken: "...." }
// ...
};
const app = twilioIvr(states, config);
// Start up your server to take requests from twilio.
app.listen(3000);
Config Options
The config object allows the following keys:
twilio
(required): an object containing twilio settings.authToken
(required): your twilio auth token.validate
(optional, default:true
): a boolean indicating whether the express app should reject incoming requests that aren't coming from twilio. For security, it's a good idea to set this to true in production. However, when making requests to your own app during development or testing, it's very convenient to set it to false. (Note: the app verifies that the request is coming from twilio by checking if it’s signed with your auth token, which only you and twilio are supposed to know.)
staticFiles
(optional): an object configuring if/how the returned express app should serve static files to twilio. Serving audio files in a cache friendly way is a common need in IVR systems and this library can help with that. See the static files section for the keys allowed here.urlFor
(required): an object that helps the library generate urls.host
(required): a function that must return the host that should be used by default in absolute urls generated by the library'surlFor
function. Most likely, your function should just return a constant string. In case you need to do something more sophisticated, this function is passed express' request object. Note: be careful here about using the HTTPHost
header to generate the return value, as that creates a security risk.scheme
(optional): a function that returns the scheme to use by defualt in absolute urls generated by the library'surlFor
function. If this is not provided,req.protocol
will be used.
States
The concept of a "state", in the finite state machine sense of the term, is at the heart of this library. It allows you to describe your call system as a set of states, each of which can transition to other states depending on input from the caller (or on information from outside the call, like the time of day etc).
Anatomy of a State
A state is just an object (a POJO is fine). This library defines some properties/methods that a state can have, which affect how it's handled (e.g. how the library will pass it input). The allowed properties are:
twimlFor(urlFor, inputData?)
(optional): a function that returns the Twiml used to "render" the state to the caller. For example, the caller might be on a state that asks them to choose from a list of options. To present those options to the caller, your application has to provide some Twiml (probably using<Say>
or<Play>
) to read out the options. This function would be responsible for returning that Twiml. States with atwimlFor
property are called renderable states.transitionOut(inputData?)
(optional): a function that's called to determine the next state. States with atransitionOut
function are called branching states. A state'stransitionOut
function is usually called in response to caller input, or to a new call coming in, and receives that input data. However, it can also be called indirectly; see below. It returns the next state, or a promise for the next state.backgroundTrigger(urlFor, inputData?)
(optional): a function called just before the state is rendered (backgroundTrigger
is only available on renderable states). This function can be used to kick off background operations that should happen as a result of reaching this state. Note: this function does not block rendering the state, sotwimlFor
should not assume that anythingbackgroundTrigger
does has been completed at render time. (backgroundTrigger
may be given the ability to block, or to be used on non-renderable states, in the future.) States with abackgroundTrigger
function are called asynchronous states.uri
(optional): most states will not have auri
property; this property is used primarily on your system's "entry state" (i.e., the state that twilio will use to start a call, likely through the incoming call webhook). However, if there are other states that you need to be able to "jump to" directly (i.e., point twilio to, and have it continue an existing call from there) those must also have auri
. Theuri
property should hold a relative uri string that will be used by the library to create an expressPOST
listener that, when requested, consults the state to figure out how to respond. States with auri
property are called routable states.processTransitionUri
(optional): a relative uri where caller input data should be sent; data sent to this uri will be passed to the state'stransitionOut
method to determine the next state. (TheprocessTransitionUri
only applies states that are branching and renderable.) Likeuri
, the uri given here is turned into an expressPOST
listener by the library. A state'stwimlFor()
method should render twiml that instructs twilio to send the relevant user input data to theprocessTransitionUri
(see examples below). States with aprocessTransitionUri
property, which are also renderable and branching states, are called normal states as they tend to be the most common state type.isEndState
(optional): this property, if present, can only have one value:true
. It's used to mark a state as an end state of your call (see below).name
(optional): a string that uniquely identifies the state (among all your states). Will be used in the logs for easier debugging.
Valid States
As you can see, almost all properties on a state are optional, and many of the properties can be used together to create states with interesting behaviors. However, not all combinations are valid. Below are all the valid combinations, with an example of where you might use each:
End States (Routable or Not)
An End State is a renderable state that doesn't branch to anywhere else. As its name would suggest, an end state is (almost?) always the last state in your call. Here's an example end state that just hangs up, which is common behavior:
var endState = {
name: "END_STATE",
isEndState: true,
twimlFor() {
// If you don't want to built raw XML, you can also return a TwimlResponse
// to simplify this. See https://twilio.github.io/twilio-node/
return `<?xml version="1.0" encoding="UTF-8"?>
<Response>
<Hangup />
</Response>`;
}
}
The end state above isn't routable (i.e., other states can transition to it, but it doesn't have an HTTP endpoint that twilio can request directly to render it). That's usually what you want in your end state. If, however, you wanted to be able to hijack a running call to render an error message after some external condition had failed, you could have a routable end state like this:
var endStateRoutable = {
name: "UNKNOWN_ERROR",
uri: "/unknown-error",
isEndState: true,
twimlFor() {
return `<?xml version="1.0" encoding="UTF-8"?>
<Response>
<Say>An unknown error has occurred. Please leave us a message after the beep.</Say>
<Record />
</Response>`
}
}
Once registered with the library, this state would result in the creation of a POST /unknown-error
endpoint that you could redirect the running call to, and that would render the above Twiml.
Normal States (Routable or Not)
Most states in your system will probably be normal states, as they have all the machinery for playing something to the user, gathering input, and deciding what to do based on that input.
Below is an example of two normal states, one routable and one not, that, with the end state above, could form a simple IRV system:
// Note the `uri` in the (routable) entry state below, which will produce a
// POST /incoming-call endpoint that we can use as our twilio webhook handler.
var entryState = {
name: "CALL_RECEIVED",
uri: "/incoming-call",
processTransitionUri: "/incoming-call-transition-out",
twimlFor() {
// Below, we use <Gather>'s action attribute, (and the <Redirect> for the case
// that the Gather times out) to send the user input to the `processTransitionUri`.
return `<?xml version="1.0" encoding="UTF-8"?>
<Response>
<Gather action="${entryState.processTransitionUri}">
<Say>Welcome to Foo Corp! To hear our lunch specials press one. For our hours, press two.</Say>
</Gather>
<Redirect method="POST">${entryState.processTransitionUri}</Redirect>
</Response>`;
},
// Process the input to return the next state.
// On no/invalid input, play the same state so the caller can try again.
transitionOut(inputData) {
let inputDigit = (inputData.Digits || [])[0];
switch(inputDigit) {
case "1":
return lunchSpecialsState;
case "2":
return hoursState;
default:
return entryState;
}
}
}
// By contrast, here's a normal state that's not routable, meaning it has no `uri`.
// It can be pointed to by other states' transitionOut, but never rendered directly by twilio.
var lunchSpecialsState = {
name: "LUNCH_SPECIALS",
processTransitionUri: "/lunch-specials-transition-out",
twimlFor() {
return `<?xml version="1.0" encoding="UTF-8"?>
<Response>
<Gather action="${lunchSpecialsState.processTransitionUri}">
<Say>Our lunch special is a hamburger.</Say>
<Say>If you want to now hear our our hours, press one.</Say>
</Gather>
<Redirect method="POST">${lunchSpecialsState.processTransitionUri}</Redirect>
</Response>`;
},
transitionOut(inputData) {
let inputDigit = (inputData.Digits || [])[0];
// Play hours if the user enters one.
// Otherwise, hang up (using our end state from earlier).
return inputDigit === "1" ? hoursState : endState;
}
}
var hoursState = {/* left as an exercise to the reader. */ };
Non-Renderable Branching States (Routable or Not)
So far, all the branching states (i.e. those with a transitionOut
) that we've seen have been renderable. But, sometimes, you want to branch based on something other than a prompt you render to the caller. One example might be determining the next state based on the time of day:
var branchingState = {
name: "CHECK_IF_MORNING",
transitionOut() {
return (new Date()).getHours() < 12 ? callFrontDesk : recordVoicemail;
}
}
The above state isn't routable but, if you wanted this check to happen as the first thing when when a call comes in, you could make it routable (by adding a uri
member) and use it as the entry state/twilio webhook destination.
Asynchronous States (Routable or Not)
Asynchronous states are used when you need to kick something off in the background before rendering the state. Below is an example of an asynchronous state that performs a slow lookup of the weather while the user is told to wait. Then, if the lookup succeeds, it redirects the user to a state that reads the weather; otherwise, it redirects to an error state.
var lookupWeatherState = {
name: "LOOKUP_WEATHER",
twimlFor() {
return `<?xml version="1.0" encoding="UTF-8"?>
<Response>
<Say>We're looking up the weather. Please hold on.</Say>
</Response>`
},
backgroundTrigger(urlFor) {
lookupWeather().then(weather => {
// save weather in the call session, and use twilio's rest api
// to redirect this call to a state that will play the weather.
}, error => {
// the lookup failed, so use twilio's rest api to redirect the call
// to (e.g.) the routable UNKNOWN_ERROR end state we made earlier.
});
}
};
The above asynchronous state isn't routable, but, like with the example non-renderable branching state, it would be easy to make it routable if you wanted to use it as the entry state to your call.
Other Features
Static files and urlFor
It's very likely that, at some point, you'll want to play a recording to the caller, which you can do by pointing to an audio file in your Twiml. When twilio requests your audio file, you'll want to ensure that your HTTP response includes headers telling twilio to cache the file; if you don't, twilio will have to download and transcode the audio file every time it's played, and your callers will experience a delay before the audio starts. In addition, if/when you update your audio with a new recording, you'll want to force twilio to pick up and use the new version right away.
The common way to handle both these needs is to set far-future Expires
/Cache-Control
headers on your static file responses, and then bust the cache by updating the url when the file changes. This technique is explained here.
The twilio-interactive-flow library makes it really easy to do this. First, you provide a few options in your static files configuration (described below). Then, use the urlFor
function, passed by the library as the first argument to twimlFor
and backgroundTrigger
, to generate urls for your static files that automatically have a query parameter whose value will change every time the file is changed, to bust the cache. These are called "fingerprinted urls".
For example:
var playAudioState = {
twimlFor(urlFor) {
return `<?xml version="1.0" encoding="UTF-8"?>
<Response>
<Play>${ urlFor("/intro.mp3") }</Play>
</Response>`
}
};
Above, the urlFor call will generate a fingerprinted url like: /intro.mp3?v=16acf2382173d....
. Note: the argument to the urlFor function is the public url to your static file (not its path on disk), just without the version parameter.
Note: cache busting using a query parameter is generally not as reliable as cache-busting by changing the non-query part of the URL, because not all caches include the query parameters in their cache keys. However, twilio's cache does, and that's the only cache we care about.
The urlFor
function optionally takes other options, including whether to make the generated url absolute (which twilio needs in some cases) and an object of query parameters to add to the generated url.
Below are the options you can provide under the staticFiles
key of the config object, to get the functionality described above:
path
orfingerprintUrl
. If you're using the library's static file handling at all, you must provide one and only one of these:path
: an absolute path to the folder on disk containing your static files. If this is provided, the library will automatically scan that directory when your app is started and generate a fingerprint for each file, which will then be used to generate urls by theurlFor
function.fingerprintUrl
: a function that takes an unfingerprinted (root-relative) url for a static file, and returns the fingerprinted version. If provided, this function will be used by theurlFor
function to generate the fingerprinted urls. Only use this option if you can't usepath
for some reason. Possible reasons: your static files are not located on disk, but are retreived over the network; or, your static files change while the app is running, and you need the fingerprints to be regenerated without restarting the app.
mountPath
(optional, empty by default): a path segment that will be used as a prefix in the urls for your static files. For example, if you have a static file calledintro.mp3
and you want it served athttps://example.com/static/intro.mp3
, you'd setmountPath
to/static
. If this is provided, it should be a string starting with a/
.middleware
(optional): an express middleware that will be used to actually serve the static files and set the appropriate caching headers. By default, the library automatically handles this if thepath
option is used (since it knows where to find the files). But, iffingerprintUrl
is used instead, the library defaults to callingnext()
for requests under the static file mount path, allowing you to handle those requests outside of twilio-ivr. In either case, though, you can use this option to provide a custom middleware to override the library's behavior. Note: in your middleware,req.url
will refer to the fingerprinted url with the static files mount path removed. Your middleware will be called to handle all requests under the static files mount path, including the hold music endpoint (if you set that endpoint up). If you'd like to use the default middleware to serve the hold music endpoint's contents, see here.holdMusic
(optional): an object containing keys specifically related to setting up the hold music endpoint; if not provided, no hold music endpoint will be created.fileRelativeUri
(required, if the holdMusic object is present): the URI of your hold music audio file, relative to/excluding the base URI where all your static files are served from. So, if your static files are served out ofhttps://example.com/static
, and your hold music file is athttps://example.com/static/hold.mp3
, you'd set this option tohold.mp3
(or./hold.mp3
).endpoint
(optional, defaults to/hold-music
): the unfingerprinted uri to use for the hold music endpoint. This uri will be nested under the static files mount path if one is provided. I.e., if the mountPath is/static
and the endpoint is/hold-music
, the full, unfingerprinted hold music uri will/static/hold-music
.twimlFor(urlFor)
(optional): a function you can provide to override the built-in logic for generating the hold music endpoint's Twiml.
Hold Music
[Coming Soon]
Call Sessions
[Coming Soon]