DEVELOPING A VOICE USER INTERFACE IN JUST 24 HOURS?
Together with 15 other teams we took on this challenge at the “TALK TO ME, BERLIN!” hackathon: We developed a voice user interface (VUI) for rental bikes that guides tourists through Berlin, and mainly focused on its personality. And this approach payed off: Our voice assistant “Alex, the bear” won the Best Character award.
BERLIN NEEDS A BIKING TOUR GUIDE
Before the event, we—a team of designers and developers—generated ideas and worked out which of these ideas would have the greatest potential. Rental bikes equipped with a voice assistant that enables visitors to start off on a tour through the city without preparations? Sounds tempting, especially for active and tech-savvy tourists who want to explore Berlin without constantly having to get their phones out.
A VUI WITH PERSONALITY?
What could be better suited for Berlin than a bear, the city’s heraldic animal? This idea formed the basis for “Alex, the bear”, and a large part of its personality was quite obvious: laid back, with a dry sense of humor and the brash slang that is typical of Berlin.
But how did we transport this personality into a VUI? First, we defined age and gender and selected a fitting male voice. The tonality was outlined using adjectives that laid the basis for the dialogues. The texts enabled us to define Alex’s reactions to certain situations—and all this individuality was the reason for Alex receiving the award.
The conversation was created as a Google Action in the graphic dialogflow editor and is accessed via the Google Assistant. A Google Action is comparable to an Alexa Skill.
TEST ALEX!
Say “hello” to start Alex.
MORE THAN JUST A VOICE ASSISTANT
Finally, we brought Alex to life as an IoT prototype: Alex guides tourists through Berlin using speech and tells them more about the subjects that interest them. From the typical sights to street art all the way to a tour of Berlin’s iconic late-night corner shops—everyone will find something to suit their interests. Moreover, an integrated camera detects sights while the system points them out to the users.