Parrot Time - The Technical Secrets of LingoHut, Maybe

Parrot Time

The Thinking of Speaking

Issue #30 November / December 2017

Home

Extras

The Technical Secrets of LingoHut, Maybe

by Erik Zidowecki

November / December 2017 |

You have probably seen LingoHut by now, or at least got an idea of what it is like. It is slick. It is powerful. And I can guarantee you, the work that has gone into is phenomenal.

Most will be amazed by the volume of vocabulary there is for each language, along with recordings, and how much of that is spread across a dozen languages. That work alone is a cumulation of years and would correspond to several printed books.

As a programmer, I also look at websites and see how things were put together, think about what they had to overcome, and marvel at how smoothly it all works. Always remember that behind every learning platform is a devoted programmer.

Most of the time we hear from Kendal, who is the promoter and "face" of LingoHut, but she will always be sure to include her husband and partner, Philipp, in the credits, as he is the programmer who made it all possible.

Rather than ask Philipp all his secrets, I thought instead to talk about three of the larger issues which have to be considered and built in something like this. Just like how simple a wall switch appears when you flood the room with light, the ease of using the site hides the complexity of the system.

Words

The biggest part of LingoHut is its collection of vocabulary. They have compiled hundreds of words and phrases together to make up the lessons, and they need to have those for multiple languages. So how are they storing those?

Let us look at German as a basic example. The German course gives the person a word in both English and German, with an audio of the German pronunciation. This means that somehow, the English and the German words have to be stored as pairs, or at least in a way in which they can easily be paired together.

Undoubtedly the easiest way to do this is to have all the words stored in a file, with each line holding an English / German combination, separated by a character. For example, dog/Hund. Then the file could be loaded in, stored as an array (which is the computer equivalent to a list), and any pair could be accessed effortlessly.

This method is easy to make, simple to maintain, and straightforward to read. The only real drawback with it in terms of speed is that it takes a few extra microseconds to parse (split) the words and place them into the array. If you are willing to lose some of the file space and readability, you can store the data as a PHP array, already parsed. This means that whenever you need to load it, you just tell the compiler to load it. Boom. The job is done.

My main concern for storing it all like this is it is rather inflexible. From this list, I can only ever produce materials for English / German. If I wanted something like Italian / German, I would need a completely new file with all those words. Not bad, but what if you wanted several mixes, like French / German, Turkish / German, Swahili / German, etc. And what happens when you find you misspelled or mistranslated a German word? You would need to change numerous files.

If this is a concern to you as well, then we would have to consider a way to link words together across languages. It is not actually that hard. We simply assign each word a unique key of some kind (numbers, alphanumeric, etc.) and use that key for the same word in each language. For example, we could assign the key "1" to "dog". So in the German file, rather than looking for "dog", we look for "1" and find with it "Hund". In the Italian, we would find "cane". This is the way basic databases work, with keys linking data together.

With this method, you can now mix any language with any other language without creating a new file for each pairing.

There is another file solution I will mention only as advisement on what not to do. When creating data storage methods, there is always the question of balance between space and speed. The first way I listed, with just the words in a paired listing, was simple to read and relatively small, needing only space for the words. The second method, with storing the file as a premade array, made the file harder to read and increased the size, but made it faster to load into the program.

Several years ago, a storage method was introduced using files and "tags". A tag is a specific word that defines the data which comes next. HTML is an example of a widespread existing markup language (the "ML" in "HTML"). You would use a tag like "title", enclosed in brackets <> to tell what the title is. A slash would be included to end the data. How to Build a Fish

This new system was called "XML" (note the "ML" again) and it took the internet by storm. Everyone was putting all their information into it, in the hope it would make sharing the data easier. Because it was tag driven, everyone could within the code define their own tags, so it was conceivably a universal data structure.

However, there is a reason they teach courses on data structures, with the pros and cons for each laid out. That is because there is no "best" or "universal" way.

An associate of mine discovered this when he attempted to take all the word data we were using for a site and implement it this way. It was pretty. It was ordered. It was also bloated and slow.

See, to make data retrieval fast, you want to do as little work as possible. When you know that each line has two pieces of code and that the first one is an English word and the second is a German one, then you can instantly grab those pieces of data and store them in the right place. You do not need to figure out what they are.

However, in the tag system, before you can find the data, you need to find the tags defining where it is. Essentially, you need to find a tag, like and the other tag , then do the math to figure out which characters in between are to be extracted as the English word. You do the same for the German. But you also need to make sure they are for the same pair, so you have to make sure they both fall within the tags . Programming wise, this requires lots of comparison tests and some basic math for each item.

So right away, your speed is gone. What about size? Look again at our dog/Hund example. That is 8 characters long ( dog = 3, / = 1, Hund = 4). Putting the same data into an XML file might look like dogHund. That is 55 characters, which is an increase in size by almost 7!

When we attempted to use the data stored in this structure, what used to take a page a few seconds to load now took 5 minutes! So we lost both speed and size with this data structure.

These solutions all depend on using files for word lists. Some people do not like those, as they can easily be garbled if an edit goes wrong, or completely deleted with the touch of a wrong key.

So that is when we put them all directly into a database. A database can be made to act like a list because you are essentially still pulling in all the relative data and storing it again in an array. And you can store it as dual language words per entry, as described in the first method, or as single language entries with keys. Actually, when using a database, you will likely have at least one unique key for everything.

The main strength perhaps of a database is the flexible access. You can change any word without affecting the others, while with files, you are opening a file with all words, making a change, and saving it again, hopefully without affecting anything else.

Databases are also good for when you have many people making changes to the data. Having people making changes to files can be tricky and hazardous.

The downside is the overhead, since every entry needs extra data to define it, and you need to do a load on all the data and putting it into an array again. But in truth, the differences between all three methods (flat file, array file, database), is probably so small so as to not be noticed on most systems.

All pages

The Technical Secrets of LingoHut, Maybe

Writer:

Erik Zidowecki

Images:

Petey: Data tunnel (splash title); Pencil and pad; Lockers; Database; Microphone; Cafeteria wall

All images are Copyright - CC BY-SA (Creative Commons Share Alike) by their respective owners, except for Petey, which is Public Domain (PD) or unless otherwise noted.

Comments

comments powered by Disqus

In this issue:

Read PDF online here

Archives
Missed something? Find previous issues in the archives.
Issue 35 Issue 35 Letter From The Editor - A Call to Action • Mark Your Calendar • A User-Friendly Introduction to the Tuvaluan Language • An Indigenous Year • 13 Fascinating Facts about Marshallese • A History of the Language of the Roma • Interesting Facts About Vurës - An Indigenous Language of Vanuatu • In Focus - Indigenous Faces • The Indigenous Languages of the UK • In Others' Words - Emily McEwan • At the Cinema - Moana • Language Puzzles • Basic Guide to Nahuatl • At A Glance	Issue 34 Issue 34 Letter From The Editor - New Apps, Old Languages • How it All Started: PluraLing • Languages in Peril - Calabrian Greek • Teddy Talks - Toki Pona and Tok Pisin • In Focus - Paris, France • Book Look - Publio Aurelio - un investigatore nell'antica Roma (series) • At the Cinema - The Extraordinary Adventures of Adèle Blanc-Sec • Proverbs from the World - Tver Karelian • Language Puzzles • Basic Guide to Amharic • At A Glance	Issue 33 Issue 33 Letter From The Editor - Italian Women • News Brief • Mark Your Calendar • Lasciare Andare – - Learning Italian And The Art Of Letting Go • A New Sicilian Author on the Horizon • In Others' Words - Deborah Caruso • Celebrations - Giostra del Saracino • In Focus - Arezzo, Italy • Book Look - Acquisition of Word Formation Devices in First & Second Languages” (2017) • Book Look - Waking Isabella: Because beauty can't sleep forever • At the Cinema - Life is Beautiful • Proverbs from the World - Italian • Language Puzzles • Basic Guide to Sicilian • At A Glance
Issue 32 Issue 32 Letter From The Editor - Letter From The Editor • News Brief • Mark Your Calendar • Fiverr and Languages: - Finding Translators on the Mini Jobs Network • Why I Will Never Learn the IPA • Teddy Talks - Can You Speak Bahasa? • In Focus - Yellowstone National Park, USA • Proverbs from the World - Swedish • Language Puzzles • Basic Guide to Punjabi • At A Glance	Issue 31 Issue 31 Letter From The Editor - Up the Guru Path • News Brief • Mark Your Calendar • How to Learn Any Language - - Breaking Down the Language Guru Myth • The Ethics of Language Teaching • Teddy Talks - Language Meetups • In Focus - Mykonos, Greece • Proverbs from the World - Latvian • Language Puzzles • Basic Guide to Japanese • At A Glance	Issue 30 Issue 30 Letter From The Editor - Being Special • Building a Language Dream • Learning Multiple Languages with LingoHut • LingoHut - A Simple And Effective Language Learning Resource • The Beginning Of A Surprising But Promising Partnership - LingoHut and TeachMe.vn • The Opportunity to Succeed • Review - LingoHut • The Technical Secrets of LingoHut, Maybe • Language Puzzles • In Focus - Amsterdam, Netherlands • Proverbs from the World - Spanish • Basic Guide to Turkish • At A Glance
Issue 29 Issue 29 Letter From The Editor - Language Confusion • News Brief • Mark Your Calendar • Stepping Up Your Language • Montreal LangFest 2017: Another blowout success • How Can My Study Book Be Monolingual? • Boredom And Classroom Students - A Teacher's Perspective • In Others' Words - Malachi Rempen • In Focus - Venice, Italy • Proverbs from the World - Finnish • Language Puzzles • Where Are You? • Basic Guide to Dutch • At A Glance	Issue 28 Issue 28 Letter From The Editor - The Finer Points • News Brief • Mark Your Calendar • Language Perceptions • Language Learning in Resource Poor Languages • Why Fluent English Language is Important for Business • In Others' Words - Máirín F Millward • Keeping Up With The DLC • In Focus - Beijing, China • At the Cinema - Welcome Mr. President • Proverbs from the World - Chinese • Language Puzzles • Where Are You? • Book Look - Dreaming Sophia: Because Dreaming is an Art • Basic Guide to Finnish • At A Glance	Issue 27 Issue 27 Letter From The Editor - In This Issue • News Brief • Mark Your Calendar • More About Cognates Than You Ever Wanted to Know • A Peek into Pinyin • An Art Exhibition That Spoke To Me • The Learning Mindset • In Focus • At the Cinema - Krrish • Language Puzzles • Proverbs from the World - Dari (Afghan Persian) • Where Are You? • Basic Guide to French • At A Glance
Issue 26 Issue 26 Letter From The Editor - Nom de Plume • News Brief • Mark Your Calendar • Language Learning In The Globalization Era: - Translation, Culture And Power Relations • When Pigs Fly • Introducing Words R Us • Languages in Peril - The Good Language of Brazil • In Focus • At the Cinema - Un Sac de Billes • Language Puzzles • Where Are You? • Book Look - Aikainen lintu nappaa madon. Sananlaskuja läheltä ja kaukaa • Basic Guide to Croatian • At A Glance	Issue 25 Issue 25 Letter From The Editor - No Politics • Make Your Own Language Group • A History of Research in Study Abroad • Parrot Time on Patreon • Languages in Peril - Sayonara, Ainu • At the Cinema - La Coppia dei Campioni • Where Are You? • Book Look - The Bible of the Language Learners and Polyglots • Basic Guide to Romanian • At A Glance	Issue 24 Issue 24 Letter From The Editor - Saying Without Meaning • Introducing Southeast Asia in Taiwan • At the Cinema - Around the World in 80 Days • Where Are You? • Book Look - Bad Words Dictionary • Basic Guide to Welsh • At A Glance
Issue 23 Issue 23 Letter From The Editor - Hope and Failing • Six Ways To Choose Which Languages To Learn • Learning Spanish - The trials, the tribulations and one triumphant learning hack • At the Cinema - The Last King (Birkebeinerne) • Celebrations - Birkebeinerrennet • Where Are You? • Book Look - Langenscheidt Dictionaries • Basic Guide to Swedish • At A Glance	Issue 22 Issue 22 Letter From The Editor - Culture and Language, Again • Learning A Language Is Learning Its Culture • Revisited - Early Bardic Literature in Ireland • Languages in Peril - Save Medan Hokkien! • In Others' Words - Ulrike and Peter Rettig • At the Cinema - Monster Hunt • Where Are You? • Book Look - Language Alter Ego • Basic Guide to Italian • At A Glance	Issue 21 Issue 21 Letter From The Editor - A Kind Word • Language and Power: The Hidden Struggle • 4 Ways To Learn Through Reading • Language Learning is for everyone! • Languages in Peril - The Decline of Sicilian • At the Cinema - The Host • Where Are You? • Book Look - Italian Short Stories for Beginners • Basic Guide to Hungarian • At A Glance
Issue 20 Issue 20 Letter From The Editor - Double Speak • On Being Bilingual • Language Creation and Deities • A Medley of Virtual Languages • In Others' Words - Siskia Lagomarsino • At the Cinema - Dilwale Dulhania Le Jayenge • Where Are You? • Basic Guide to Polish • At A Glance	Issue 19 Issue 19 Letter From The Editor - Making it Happen • Motivation - Expressing oneself and the expression of oneself in language learning • Motivation Killers in Learning a Language • Mixing Languages and Relationships • In Others' Words - Brian Powers • At the Cinema - Cutting Room Floor • Languages in Peril - Cyprus' Language Revival Approach Problem • Where Are You? • At A Glance	Issue 18 Issue 18 Letter From The Editor - The Importance of Travel • Broadening The Mind Travels The World • The Secret Life of Diacritics • There Are No Wrong And Right Gestures, Only Cultural Differences • Google Translate Exposed: - The Truth Behind Everyone's Favorite Translator • At the Cinema - Queen • Book Look - The A to Z of Learning German • Where Are You? • Basic Guide to Papiamentu • At A Glance
Issue 17 Issue 17 Letter From The Editor - Free Things • The Cost of Free Language Resources • Review of Polyglot Workshops: Brazil • Easier Way to Learn Languages Fast • Dream, decide, do - tips from a polyglot • At the Cinema - Cambio de Ruta • Languages in Peril - Talysh • Where Are You? • App Rev - Tandem • Book Look - Language Master Key • At A Glance	Issue 16 Issue 16 Letter From The Editor - Studying in Summer • Polyglot Events All Around The World - You Are Not Alone • Playing Games with Language • Spanish E-training – The 'Big Bang' Investment • Can a Language Die? • At the Cinema - La Casa del Fin de los Tiempos • Languages in Peril - Scottish Gaelic • Words in Your Mouth - Apple • Celebrations - Nag Panchami • Where Are You? • Book Look - Fluency Made Achievable: The Fluent Guide to Core Language Skills • At A Glance	Issue 15 Issue 15 Letter From The Editor - Sounds Like • How Do You Say It? - A look at sound notation systems • Of Pidgins and Creoles - A look at how some languages are born • Who Are You To Learn A Language? • At the Cinema - Dil Chahta Hai • Languages in Peril - Yumans on the Edge • Words in Your Mouth - Egg • Where Are You? • Book Look
Issue 14 Issue 14 Letter From The Editor - Breaking with Tradition • Are You Wasting Your Money on Language Classes? • Chatting in Languages Online - Part 2: Voice Chats • Why English Is Different Than Any Other Language • The Digital Language Collective • At the Cinema - Viva La Libertà • Languages in Peril - The Tribes of the Tamil-Kannada • Words in Your Mouth - Rice • Where Are You? • Book Look	Issue 13 Issue 13 Letter From The Editor - Thirteen • Chatting in Languages Online - Part 1: Text Chats • Why Do People Learn Languages? • The Question Of Practice - An International Language Is Possible • At the Cinema - Chinese Puzzle • Celebrations - Fastelavn • Words in Your Mouth - Cheese • Where Are You? • Book Look	Issue 12 Issue 12 Letter From The Editor - Over Time • Which Language Is...? • The Ultimate Fate of Language Learning • 5 Funny Words In Afrikaans From My Perspective • At the Cinema - Everybody's Famous! • Word on the Streets - Why Writers are Important • Words in Your Mouth - Milk • Where Are You? • Book Look
Issue 11 Issue 11 Letter From The Editor - World Ambassadors • Coming Home to Faroese - The Why and How of Learning a Small Language • Danish and Faroese: A Biography • At the Cinema - Ludo • Basic Guide to Faroese • Celebrations - The Faroese Festival Summer • Revisted - The Faroe Islands • Word on the Streets - Famous Faroe Islanders • Where Are You? • The Grind: Why the Faroese Hunt Whales • The Legend of the Scottish Princess • Faroese Ballads - Nornagest Ríma and Ormurin Langi	Issue 10 Issue 10 Letter From The Editor - Expansion • Religion in Culture • Languages in Peril - Decline of the Gallo-Italics • Language Learning and Translation • Word on the Streets - Italian Greats • Book Look • At the Cinema - Xingu • Celebrations - Hangul Day • Where Are You? • Words in Your Mouth - Bread	Issue 9 Issue 9 Letter From The Editor - Tracing Words • Constructed Languages - Making It All Up • Language Conflicts - Flemish vs. Walloon • Rohonc Codex - Hungarian Enigma • At the Cinema - Il Comandante e la Cicogna - Garibaldi's Lovers • Where Are You? • Words in Your Mouth - Sausage • Book Look • GlobTech - Using Locale
Issue 8 Issue 8 Letter From The Editor - Globalization • Speaking with Aliens • Celebrations - Esala Perahera - The Festival of the Tooth • Language Conflicts - Bokmål vs. Nynorsk • At the Cinema - Pane e Tulipani - Bread and Tulips • Revisited - Words Which Have Changed Their Meaning • Languages in Peril - Keeping Up With The Kartvelians • Where Are You? • Sections - Reviews • Word on the Streets - Indonesian Innovators • GlobTech - Google Translate Section	Issue 7 Issue 7 Letter From The Editor - The Highlander Condition • When Languages Meet • At the Cinema - Mal Día Para Pescar - Bad Day to Go Fishing • Celebrations - Tanabata - The Star Festival • Languages in Peril - The Romanian Relatives • Revisited - Words Made By Great Writers • Where Are You? • Language Learning Methods - Immersion • Sections - Links	Issue 6 Issue 6 Letter From The Editor - Price of Fame • Liber Linteus - Mummified Language • Pencak Silat • At the Cinema - Bombay • Celebrations - Inti Raymi - Festival of the Sun • Cracking the Code • Languages in Peril - The Chibchan Family • Revisited - Words From The Names Of Animals • Word on the Streets - Great German Authors • Where Are You? • Language Learning Methods - Internet • Sections - Neighborhood
Issue 5 Issue 5 Letter From The Editor - Why Polynesian? • Rongorongo - Island Chants • Otto Dempwolff - Islands of Language • At the Cinema - Whale Rider • Celebrations - Pasifika Festival • Special Feature - Avoiuli • Languages in Peril - The Island Invasion • Revisited - Legends of Maui - Maui's Home • Word on the Streets - Malay Masters • Where Are You? • Revisited - Legends of Maui - Maui Snaring the Sun	Issue 4 Issue 4 Letter From The Editor - Linguist or Polyglot • The Phaistos Disc - Puzzle of Crete • Otto Jespersen - Progress of Language • At the Cinema - Kukushka - The Cuckoo • Celebrations - Carnival • Languages in Peril - The Salish Tragedy • Word on the Streets - Kannada Writers • Where Are You? • Revisited - Stories In The Names Of Places • New Souls • Language Learning Methods - Software • Sections - Parleremo YouTube	Issue 3 Issue 3 Letter From The Editor - Freaking Out • The Voynich Script - Cryptic Codex • Benjamin Whorf - Relativity of Language • At the Cinema - Lost in Translation • Languages in Peril - The Polish Connection • Word on the Streets - Romanian Poets • Where Are You? • Celebrations - Holi • A Language Dream • Revisited - Words From National Character • Language Learning Methods - Classes • Sections - Language Exchange
Issue 2 Issue 2 Letter From The Editor - Truth in Advertising • Linear A & Linear B - Lost Minoan • Edward Sapir - Patterns of Language • At the Cinema - Atanarjuat: The Fast Runner • Word on the Streets - Norwegian Notables • Where Are You? • Celebrations - Valentine's Day • Languages in Peril - The Rhaeto-Romance Trio • Revisited - Proverbs • Linguistics Love Song • Language Learning Methods - Books • Sections - Recordings	Issue 1 Issue 1 Letter From The Editor - A New Parrot Time • The Rosetta Stone - Triple Cypher • Ferdinand de Saussure - Signs of Language • At the Cinema - L'auberge Espagnole • Languages in Peril - The Finno-Ugrics • Word on the Streets - The Russian Zone • Where Are You? • Celebrations - Day of the Dead • Revisited - Slang • We Are The Linguists • Language Learning Methods - Audio • Sections - Journals

Puzzle books in over 30 languages

Make learning a language fun!

Supplement you learning with quiz and activity books

Home

About Us

Read PDFS Online

Main Contents
	Letter From The Editor - Being Special
	Building a Language Dream
	Learning Multiple Languages with LingoHut
	LingoHut - A Simple And Effective Language Learning Resource
	The Beginning Of A Surprising But Promising Partnership - LingoHut and TeachMe.vn
	The Opportunity to Succeed
	Review - LingoHut
	The Technical Secrets of LingoHut, Maybe
	Language Puzzles
	In Focus - Amsterdam, Netherlands
	Proverbs from the World - Spanish
	Basic Guide to Turkish
	At A Glance
Credits

Parrot Time Magazine

The Technical Secrets of LingoHut, Maybe