Blog

Special guest on episode of “Dope Labs” podcast

I recently had the honor of being the guest expert on the latest episode of the Dope Labs podcast.

The episode is called “Signed, Sealed, Delivered” — I talked to the hosts, Titi and Zakiya, about all sorts of aspects of written language. The link below will take you to their description of the episode and some show notes; you can listen to the podcast on Spotify, iTunes, etc.

“Signed, Sealed, Delivered” – Dope Labs

ABOUT THIS EPISODE: “This episode came to life as we followed conversation building up on our timelines over the past few months. People are photographing themselves buying stamps, sending packages, and one of our favorites – mailing letters. We started to think more about it – the handwritten letter just feels so elevated. It conveys importance, formality, intentionality. It’s a lot different when you receive a text message compared to a postcard or letter, and we started asking ourselves why.”

Why do letters look the way they do?

What do the shapes on the left all have in common? What about those on the right? (Pictured, from top: Roman, Greek, Hebrew, Arabic, and Devanagri scripts.)

When you look at this image, do you know what all of the shapes on the left have in common? How about the shapes on the right?

If you know how to read more than one of those languages, then you may think the answer is that they represent the same sound, or have the same name. While that’s not a bad answer, what would you think if I told you that what all of the shapes on the left have in common is that they all came from a drawing of an ox? And on the right, a drawing of a house?

The shapes above on the left originally were a representation of an ox, and those on the right a house! Can you see the resemblance?

This is true because of how many of the world’s writing systems evolved—through a process where the earliest written symbols were pictographic or ideographic. That basically means that the most ancient forms of writing represented an idea by drawing an image to represent it… the shape of an ox’s head to represent “ox”, or the outline of a house to represent “house”.

Although the details are lost to history, researchers do know that over time these shapes became abstracted away from their origins as obviously visual representations. Instead of a symbol representing a whole word or an idea, they started to represent the smallest units of human language: “phonemes”, the basic pieces of our words. It is this evolution that allowed a drawing of an ox, which was called “alep” or “alef” in ancient Semitic languages, to represent just the first sound of that word—and why the language you are reading right now calls this letter “A”!

A drawing of an ox’s head (“alep”) becomes more abstract, and stands in for just the first sound–“a”. If you turn this familiar letter, “A”, upside down, you can still see the ox’s head.

Of course, this evolution of what written symbols represent does not explain why our shapes look the way they do—while it’s pretty easy to see how an “A” looks like an ox, it’s not so obvious for other letters. And indeed, some letters (or entire writing systems, like Korean Hangul) never began as depictions of real objects. While it’s hard to say with certainty why the shapes of our letters are what they are, some fascinating research suggests a couple of probable factors that drove our distant ancestors to create the shapes that they did. 

In Mesopotamia, the cuneiform writing system was produced using a stylus on clay tables (top), whereas in Egypt hieroglyphs were carved into stone or inked onto papyrus (middle), and in China free-flowing forms developed with the use of brushes (bottom). Notice that early shapes (top rows) look like pictures of the actual objects, but with time (lower rows) they became more abstract–in both appearance and meaning.

One of those historical factors is the medium in which the language was written. If you’ve ever tried to sign your name on a painting using a brush, or chisel it into a sculpture, you know that the tools you use have a major impact on what you are able to create. Cultures that prominently used clay or stone to write were likely influenced by those media in a way that lead to very different types of shapes than those that used ink and brushes—and the advent of the printing press or, in more recent times, computers, lead also to fundamental changes in typography.

Intersecting lines and curves are all around us. Can you see the T’s, L’s, V’s, Y’s, etc., all over this scene?

Another possible factor that may explain the shapes of our letters comes from nature. Some researchers have argued that the prevalence of certain shapes across different languages, like L-shapes and T-shapes, are a reflection of what shapes we encounter naturally in our environment (Changizi et al., 2006). Think about how a tree trunk meets the ground to form an upside-down T-shape, or how the shoreline of a lake creates a C-shape. It turns out that such intersections which naturally occur more often are also more common in writing systems across the world and history (so for example, shapes like L are very common, but shapes like A are less so).

Watch a short clip of Hubel and Wiesel’s classic experiment.

One last clue as to why our shapes look the way they do comes from what we know about brain activity in response to viewing letter-shapes like this. In visual cortex (the occipital lobe), there are neurons that respond preferentially to certain types of stimulation, which we call “visual features”. Pioneering experiments by Hubel & Wiesel in the 1960’s helped pave the way for understanding this basic phenomenon. There are groups of neurons that become particularly excited when they see lines of certain orientations, for example responding much more to a line at 45° then a line at 0° (horizontal); other neurons may prefer lines that intersect each other like in an X; still others, to closed circles like O as opposed to open ones like C. This property of visual cortex most have evolved not for the purposes of reading, but rather for recognition of objects and scenes—and it is consistent with the possibility that our letters tend to use the same shapes as those we see frequently in our environment!

References/Further reading:

• Changizi, M. A., Zhang, Q., Ye, H., & Shimojo, S. (2006). The structures of letters and symbols throughout human history are selected to match those found in objects in natural scenes. The American Naturalist167(5), E117–E139.

• Hubel, D. H., & Wiesel, T. N. (1968). Receptive fields and functional architecture of monkey striate cortex. The Journal of Physiology195(1), 215–243.

• Daniels, P. T., & Bright, W. (Eds.). (1996). The world’s writing systems. Oxford University Press on Demand.

Welcome to “The Writing Brain”

“Writing — the art of communicating thoughts to the mind, through the eye — is the great invention of the world. Great in the astonishing range of analysis and combination which necessarily underlies the most crude and general conception of it — great, very great in enabling us to converse with the dead, the absent, and the unborn, at all distances of time and of space; and great, not only in its direct benefits, but greatest help, to all other inventions. ” — Abraham Lincoln

 

It is difficult to imagine a world without written language. Although modern technologies do allow for communication across “all distances of time and of space” without written language (for example, cell phones), it surely would have been impossible for humans to develop such capabilities without written language. And despite, or perhaps because of, the ubiquitous nature of digital technology, written language is everywhere around us. And yet, anyone who can read this website probably takes their ability to do so for granted– once we learn how to read, it seems to come almost as natural as breathing.

But many of us may have found learning to read and write difficult as children; we certainly all know someone who had or continues to face such difficulties, and as teachers or parents are reminded of how this skill requires extensive practice. When someone says “I know how to read”, what is it she actually knows? How are we able to transform ink markings on a page (or black pixels on a screen!) into meaningful language? How can we best teach this skill? How is written language being changed by modern technologies? How do written languages differ around the world?

Welcome to “The Writing Brain”! These are the types of topics that will be covered on this website, through a mix of original blog posts and summaries of new research conducted across the range of fields that constitute Cognitive Science (including psychology, neuroscience, linguistics, and philosophy), with a focus on communicating the research to educators, in particular, and interested audiences in general.

Guest expert on NPR: “Forum” on KQED

I recently had the pleasure of being a guest expert on “Forum”, hosted by Mina Kim on San Francisco’s NPR station KQED. You can listen to the recording of the broadcast right here.

The episode was inspired by this article in The Atlantic written by former Harvard University President Drew Gilpin Faust, who was also a guest on the show. We were also joined by distinguished Emeritus Professor Virginia Berlinger of U Washington and Sandra Gutierrez, associate DIY Editor of Popular Science (who write this piece here).

In it, I discuss some of the science of handwriting and the brain–why experience with handwriting may be beneficial not just for your penmanship, but for your ability to spell, read, and, as suggested by some of the other guests and public who called in, even for your ability to connect meaningfully with the past.

What is dyslexia? What causes it?

 

Dyslexia is surely one of the most recognized learning disabilities, yet also one of the most commonly misunderstood. One difficulty is simply that the term is used in different ways by different groups: teachers, education specialists, and psychologists will likely all provide different definitions of dyslexia.

 

In common language, ‘dyslexia’ usually refers to difficulties with reading, but very often challenges with spelling are also present (although the term for a problem specifically with spelling is dysgraphia). And unless otherwise specified, dyslexia usually refers to a developmental dyslexia, meaning a reading/spelling deficit appearing in children as they mature. This is opposed to acquired dyslexia, meaning the loss of reading ability after a brain injury, such as a stroke (for more on acquired dyslexia and dysgraphia, read here).

 

ramus2019img
An illustration of the core phonological deficit in the brains of dyslexic children (Ramus, 2014; accessed from here on July 30, 2019). In yellow are the parts of the brain (left superior temporal lobe) sensitive to phonology in both dyslexic and typical readers; in the green circle is the part of the brain (left inferior frontal lobe) known as Broca’s area, a key area for higher-level processing of speech. A leading theory maintains that it is the connection between these two areas that is primarily responsible for dyslexia, and that genetic factors determine the strength of this connection.

There are a lot of myths and misconceptions about dyslexia (do a quick web search for “dyslexia myths” and you’ll find lots of great information debunking them), one of which I’ve written about previously (see here). What it all boils down to is that dyslexia is a difficulty with mapping between the written word and its sound and meaning—but this difficulty can arise in different ways, depending on the individual and the language being read.

 

Dyslexia rates are highest in languages that use what is known as “opaque orthography”, like English, and lowest in languages that have transparent orthography. What’s the difference? English has a very opaque orthography—think of words like “ghost” (what is that ‘h’ doing there?) and “island” (not ice-land or is-land!). Most English letters can be pronounced multiple ways, sometimes even within the same word (the ‘c’s in “circle”). Even worse, most English sounds can be spelled multiple ways (like the vowel sound in “wait”, “late”, and “freight”… for more on why English spelling is so weird, see this post here). Other languages, like Spanish and Italian, have much less complicated relationships between letters and their sounds. So for example, at the end of first grade, children learning to read English make more errors when reading (67% in the United Kingdom) than those learning French (28% in France) or Spanish (8%; Dehaene, 2009).

 

Probably the main reason dyslexia is more diagnosed in English-speaking children is because the “core deficit” of dyslexia is actually one related to phonology: a disability with learning how to map between letters and their sounds (see Ramus, 2014). This means that for most children, the best way to overcome dyslexia will be educational interventions that focus specifically on phonology and how to ‘sound out’ words. Moreover, this seems to be largely genetically determined.  While there may be associated deficits in areas besides phonology, the underlying source of difficulty for these children is in mapping between letters and sounds.

 

However, there are some children whose reading difficulties are not related to phonology, but instead to deficits in memory, attention, or vision (although this issue remains controversial). Only an educator or psychologist with special training can distinguish between different underlying cognitive deficits, as it requires comparing and contrasting performance on carefully chosen tasks. It’s also worth mentioning that for acquired dyslexia, there are clearly defined sub-types, some of which have nothing at all to do with phonology.

 

In summary:

  • Dyslexia is a difficulty with relating the written word on the page (or a screen!) to sound and meaning
  • Dyslexia in children, known as developmental dyslexia, is primarily (although not exclusively) caused by a deficit in phonology, determined by genetic factors
  • Dyslexia is not a sign or symptom of low intelligence; it is not a visual problem with seeing letters correctly or controlling eye movements

 

 

 

Suggested links:https://dyslexiaida.org

References:

•Dehaene, S. (2009). Reading in the brain: The new science of how we read. Penguin.

• Ramus, F. (2014). Neuroimaging sheds new light on the phonological deficit in dyslexia. Trends in Cognitive Sciences, 18(6), 274–275.

Writing or Reading Letters Backwards Is Not a Sign of Dyslexia

Perhaps one of the most persistent misunderstandings of dyslexia is that one of its symptoms is perceiving letters as jumbled—especially as mirror-reversed, making letters like b/d and p/q especially challenging. However, the truth is that this is a very common thing for children to do when learning to read and write. With respect to dyslexia, its most common causes are unrelated to this mirror-reversal effect, or indeed to any difficulties with perceiving or producing single letters.

Screen Shot 2019-07-24 at 2.14.14 PM
The Roman alphabet is full of shapes that represent a different letter when reversed or rotated. This is a relatively unique property of our alphabet—most writing systems around the world have few shapes that are easily confused in this way, if any at all. Rates of dyslexia are high in English, especially compared to languages written in other scripts, but this has nothing to do with the shapes of the letters.

 

So, why do so many children write letters backwards? For adults for whom reading and writing is automatic and effortless, it is understandable that this seems like a funny, obvious mistake. But it turns out that the underyling issue is a difficulty with distinguishing between different perspectives of the same object. In a well-known anecdote illustrating this phenomenon (see Dehaene, 2009), it has been pointed out that  “a tiger is equally threatening when seen or left or right profile” (Rollenhagen & Olson, 2000)—meaning, it’s often helpful to immediately recognize something regardless of which direction it is facing! Thus, the theory is that vision has evolved so that, by default, we perceive shapes like b and d or as the same object. This surely is beneficial in most circumstances, with letters being a notable exception (challenge: can you even think of other shapes or objects that have different names depending on whether they are facing left or right?).

 

Maybe one of the most interesting aspects of this is evidence suggesting that learning to read and write a language like English helps us learn to tell apart left and right. For example, illterate adults are slower to detect a visual difference between shapes like p versus q, as are fluent readers of languages that do not have such shapes that are reversals of each other (see Pegado et al., 2014; Kolinsky & Fernandes, 2014; Danziger & Pederson, 1998).

arabicbackwards
In a recently conducted study (Wiley, 2018), even adults (perfectly literate in English) produced mirror-reversed shapes when being taught letters of the Arabic alphabet. In red are the correct orientations for the letters.

 

If you notice a child writing letters backwards, the good news is that this is common—in fact, most children will experience this a stage while becoming literate, and will cease to make such mistakes by around age 11. Of course, this leaves open the question of what are signs of dyslexia, and what might be its underlying cause or causes–you can read more about that here.

To double space or not to double space?

Do you put two spaces after each sentence? Feel free to comment below! I never have– but maybe I should? Read on to find out…

 

An article recently appeared in the Washington Post with the title “One space between each sentence, they said. Science just proved them wrong.” It was prompted by a recent study, and is promoted (with this click-baitish title) as resolving one of those age-old debates: should there be two spaces, or a single space, in between sentences?

The TL;DR version? The story goes that historically, for legibility purposes, two spaces were always printed in between sentences. Like I’m now doing in this paragraph (although spoiler alert: I don’t normally double space, and don’t think it’s important!). Then along came the computer, creating an explosion of fonts and styles of typography (a topic for another day, but see some of the further reading below!). It was argued that while old fonts, which were all monospaced due to the constraints of the typewriter and printing presses (meaning, each letter had the same width), may have benefited from the double spacing, the new fonts created by computers were not constrained in this way. And it’s certainly true that most of our commonly used fonts nowadays are not monospaced (the most common exception is probably Courier).

People have strong feelings about this issue! Check out a post about it on this other cool blog: https://www.cultofpedagogy.com/two-spaces-after-period/
People have strong feelings about this issue! Check out a post about it on this other cool blog: https://www.cultofpedagogy.com/two-spaces-after-period/

The new study touted in the Wasingtion Post article (Are two spaces better than one? The effect of spacing following periods and commas during reading) reports that, using eye-tracking methods, it was found that indeed double spacing is better (read more about eye tracking here)! The reporting on this study is a good example of misleading science communication, however– a bit closer look at what the authors actually found, and you’ll see that you shouldn’t start double-spacing if you haven’t been! For those of you who do, the good news is, you may as well keep on doing it.

The best reason to not change your typing habits, based on this study? Probably none of us should ever change our behavior because of the results of one study! This is a basic, but crucial point. If you’ve ever read more than one article on a new finding about what you should or shouldn’t eat or drink, you know this frustration– science works best as compiling results, from different studies, fields, methods, researchers, and labs. So, if all you care about is whether this study has unequivocal evidence to start or stop double-spacing, you can stop reading here: it definitely does not.

So what did they find? This was the first study to directly examine whether single or double spacing is better, and as I mentioned, they used eye-tracking methods, so you can really get a lot of detailed information about how well people were reading. Are you surprised no one studied this before? Well in fact, there are dozens of studies that have looked at spacing, and that used eye-tracking– but they all looked either a t t h e s p a c i n g between letters, or the spacing between words– not just the spacing between sentences.

What did those studies might find, you ask? An amazingly mixed set of results! And what is one of the take-home messages? Two things are really important for determining how spacing will affect you: (1) whether you’re familiar with the spacing (so, you may read worse just because it unusual for you to read that way)!, and (2) which fonts were used in the study.

proportional_monospace_font_widths

And sure enough, this new study on double-spacing had results that relate to both of those matters. First, remember that claim that double-spacing was better for monospaced fonts? Well, this study did indeed use a monospaced font (Courier!) and did not test any others. And secondly, and perhaps most crucially, they found a difference between people who normally single-spaced and those who normally double-space. While there was no effect on anyone’s comprehension of the sentences or their overall reading times, people who normally type with double-spacing (which was 21/60 people in their sample), did read more words-per-minute (WPM) when they read sentences with double-spacing, compared to single spacing. For those who normally type with single-spacing, they actually read a little bit fewer WPM in double-spaced paragraphs!

The upshot for the researchers was that they found all participants, whether or not they’re used to double-spacing, basically spent less time looking at the areas of the sentences around the periods, if there were two spaces. But again: this didn’t lead to better comprehension, or even to less time spent reading overall. So while it does seem that, at least for this one study and this one font, everyone may benefit from double-spacing, the benefit is limited to a short-term one: your eyes flow more evenly past the period and onto the next sentence, but this benefit doesn’t add up to anything in the context of the entire paragraph.

So as you can tell, I’m sticking to my single spaces!

Further reading:
The History of Typography – Britannica online

The Evolution of Typography

What makes an ‘A’ an ‘a’?

How are we able to identify what an object is, just by looking at it? If you were to describe to a young child what the letter ‘A’ looks like, you might talk about it having three lines like a triangle, two of them leaning toward each other, etc. But do you think you could come up with a single description that explains why all of the shapes in the figure above are an uppercase ‘A’? Surely not—yet nonetheless, we can seemingly tell without effort that we are looking at a chart of the uppercase letter ‘A’, and not some other letter or shape. How do we do this?

Arguably the most straight-forward way would be for us to simply memorize each instance of the letter ‘A’ that we have seen, and “label” in our memory, so to speak, as examples of that letter. But consider all of the different ‘A’s we have seen—different fonts, different sizes, styles, different people’s handwriting—we would have to memorize thousands upon thousands of shapes for each letter! This burden on our memory would be inefficient to the point of absurdity, and this theory of letter recognition (called “template matching”) has largely become discounted.

Roman letter features
Fig. 2 taken from Fiset, D., Blais, C., Ethier-Majcher, C., Arguin, M., Bub, D. N., & Gosselin, F. (2008). Features for uppercase and lowercase letter identification. Psychol. Sci.

The best research on this question generally agrees with a theory called “feature matching”, which proposes that instead we memorize a (much smaller) set of features that describe the letter’s shape. This means we identify a letter by determining what its visual features are: lines, curves, intersections, etc., and comparing those features to lists in our memory. For example, when you read this letter ‘A’, visual processes in your brain determine that it is composed of two diagonal lines that meet in an L-intersection, a horizontal line that intersects them in two T-intersections, etc. The idea that our brains respond to such basic features is iwell-supported by scientific research (and arguably goes back to the cat experiments of Hubel & Weisel— check that out below!).

[youtube https://www.youtube.com/watch?v=IOHayh06LJ4?rel=0&w=560&h=315]

The upshot of feature matching is that a lot of the variability we see in fonts, size, etc., don’t matter—think of all of the ‘A’s in that figure that have in common those two diagonal lines and a horizontal line. Of course, some shapes that we consider to be the letter ‘A’ look quite different, and so it must be the case that we memorize a separate set of features to identify those ones—most obviously, the lowercase ‘a’ has very different features than the uppercase ‘A’ (two different shapes that share the same letter identity are called “allographs”; look out for Part 2 of this blog entry for more on that topic!).

The downside of feature matching is that it is not obvious what the features we actually use are—and so researchers remained challenged with determining this. One thing we do know is that expert readers, like you as you read this blog, pay attention to different features than do people who don’t have expertise (e.g., children, or second-language learners; see references). The most likely reason for this might be that experts have a better sense of what is or isn’t important—and when it comes to reading, what is most important is distinguishing between different letters. This means we become better are ignoring irrelevant features that don’t change the letter’s identity (think about some of the ‘A’s in the figure that have a lot of extra flourishes—you know that they’re not important, but someone just learning to read might not!).

Suggested reading: Hofstadter, D. (1995). On seeing A’s and As. Stanford Humanities Review, 4(2), 109–121. https://web.stanford.edu/group/SHR/4-2/text/hofstadter.html

References:
Wiley, R. W., Wilson, C., & Rapp, B. (2016). The Effects of Alphabet and Expertise on Letter Perception. Journal of Experimental Psychology: Human Perception and Performance, 42(8), 1186–1203. https://doi.org/10.1037/xhp0000213

Palmer, S. E. (1999). Vision science: Photons to phenomenology (Vol. 1). Cambridge, MA: MIT Press.

Gibson, E. J. (1969). Principles of perceptual learning and development. East Norwalk, CT: Appleton-Century-Crofts.

Fiset, D., Blais, C., Ethier-Majcher, C., Arguin, M., Bub, D. N., & Gosselin, F. (2008). Features for uppercase and lowercase letter identification. Psychol. Sci.

Courrieu, P., Farioli, F., & Grainger, J. (2004). Inverse discrimination time as a perceptual distance for alphabetic characters. Visual Cognition, 11(7), 901–919.

Is it true that we don’t look at every word when we’re reading? What are our eyes doing when we read?

Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn’t mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist and lsat ltteer be at the rghit pclae. The rset can be a toatl mses and you can sitll raed it wouthit porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.”

 

You’ve probably encountered that paragraph before (or, if you’re like me, a bunch of times—it seems to go viral every other year!). How true is this claim that “the human mind does not read every letter by itself, but the word as a whole”? The first part has some truth to it—we do not focus on each letter in a word as we read—but the second part is very misleading! It is a pernicious myth that we learn to read by memorizing words as a whole shape. And it’s easy enough to come up with examples where jumbling letters in this way is a real problem—calm becomes clam, blow becomes bowl, etc. So, what do we actually need to look at, when we’re reading?

 

As you’re reading this sentence, you might feel that your eyes are moving smoothly across it. In fact, when we read text, whether its on a printed page or a computer screen, our eyes more in a series of short jumps, called saccades. These saccades are very fast, around 20-35 milliseconds, and in between them our eyes fixate on the text. These fixations can be brief (150 milliseconds), or relatively long, say one half of one second.

 

So, what is it that we look at during these periods of fixation? It is true that we do not focus on every single word when we’re reading—this is more or less for two reasons. First, we’re able to perceive several letters within the fovea (the center of our gaze): in languages like English, which are written from left to right, we can see a few letters to the left of our fixation and maybe 12-15 to the right (in languages written from right to left, like Arabic and Hebrew, readers can perceive more letters toward the left of fixation that the right!). This means that during each fixation, we take in a few words at a time, unless there are very long words. When we saccade to our next fixation, we are able to skip over some words because we actually have already seen them. This means, of course, that one of the challenges of reading is remembering the words and letters you have recently seen (in working memory) and integrating them with new information, as you continue to saccade through the sentence.

 
visible_light_eye-tracking_algorithm

The second reason we do not need to fixate on every word is because we are often able to predict what words are going to follow—and we can use this ability to predict to speed our reading. This is often true of function words (words like “to”, “the”, and “do”), but also in sentences where the context leads to a very high probability for a certain word. Imagine that in one fixation you read “They sang Happy…”—you can guess that almost definitely the next word is “Birthday” (in fact, when we read sentences where we expect one word and it ends up being another, this surprise has consequences—it will cause us to slow down dramatically in our reading speed and often to double back and re-read!).

 

How do we know these things about reading? Mostly through the use of a machine called an eye tracker, which allows us to know (with very high temporal precision) where someone is looking. There are many videos online where you can see demonstrations of an eye tracker at work. This one in particular “How we read shown through eye tracking”) shows how we move our eyes from one line of text to the next—and how this is affected by the way that the lines are (or are not) justified!

 

Links:

“Eye movements in skilled readers”

“What eye movements during reading reveal about processing speed”

“How we read shown through eye tracking”

 

 

Why is English spelling so strange?

You may have had this experience before: you look at a word, and think to yourself “That can’t be how it’s spelled… is that right?” … that first R in February just looks wrong, sometimes. You may also have found yourself at some point asking your English teacher why something is spelled the way it is. If you got more than a response of “Because it just is.”– you’re lucky! Few people study orthography (the part of language concerned with letters and spelling), and fewer stills its history. However, we actually can find a lot of information about why words are spelled the way they are, by learning about their etymology—the chronology of a word’s form and meaning from its earliest (recorded) sources to current usage and spelling. It is a bit of a myth that English spelling is so strange because it is entirely random and arbitrary

The English language is a famous borrower—with roughly 200,000 words in the English lexicon (vocabulary), tens of thousands of words have been borrowed from several other languages, in particular Latin and French, which together account for half or more of the sources of modern English words. Other words are derived from Germanic languages, Greek, Spanish, Italian, Arabic, Chinese, you name it—and as a consequence of this global borrowing, there is added diversity in the spellings of words. This is because English orthography is relatively opaque: the relationship between letters and the sounds they represent is far from a perfect 1-to-1 mapping. For example, ‘K’ usually represents the phonological form (sound) /k/, a hard sound like in “kid”—but sometimes it’s read as /n/ like in “knife”, where it is part of the digraph (two letters that represent a single unit) ‘KN’. The letter ‘C’ is especially challenging—it can also represent the hard /k/, like the second ‘C’ in “circle”, but also the sound /s/, like the first ‘C’ in that same word! Or it can be part of the digraph ‘CH’, which can represent the sound /tS/ like in “choose”. And of course, there are those dreaded silent letters, like the ‘C’ in “indict”!

So, why is English spelling so irregular? It turns out that there is some method to the madness, which is beautifully described by D.W. Cummings in his book “American English Spelling”. The gist of it is, our spelling system as a whole is the result of a balance between competing ideas of what a words spelling should convey. You might think that the letters we use to spell a word should just tell us its sound—this is the phonetic demand. What would happen if we took this demand to the extreme, however? Should we spell the same word differently depending on the accent of the speaker—should “car” be spelled k-a-r for some of us but k-a-h for Bostonians? Should we spell homophones, words that have the same sound but different meanings, the same way—“two to many to count” becomes “too too many too count”? And how about the ‘S’ at the end of “cats” versus “dogs”—the first is pronounced as an /s/ but the second as a /z/ (say it aloud to yourself if you don’t believe me!)—should we write “It’s raining kats and dogz!”? A purely phonetic system, then, becomes problematic, because some sounds have multiple meanings, and because some words have multiple acceptable pronounciations.

MYqnbKqW6m-4

Much of the oddness in our word spellings, then, comes from a desire to use the orthography to disambiguate meaning, or to give clues to the origin of the word (and, ideally, its definition). We want to spell “dogs” with an ‘S’ so that we consistently write our plural marker with that letter. For another example, although the ‘G’ in “sign” is not pronounced, it allows us to see its relationship with other words like “signature” (and not confuse it with “sine”!).

Admittedly, the system could be improved. Some of the oddities are no longer practically useful. Unless you speak Latin, the silent ‘C’ in “indict” or the ‘B’ in “debt” are not very helpful (they’re re-introduced nods to the original Latin “indictare” and “debitum”). In fact, several letters of the alphabet could probably be done away with entirely, if we were only willing to let go holdovers from their sources (for example, ‘X’ could be replaced with “ks” or “gz”). Ultimately, changes in spelling, like changes in language in general, occur continuously and naturally through social processes—what’s “here” today may be “gon tomarow”.

 

Links:

http://www.dwcummings.com “A Site for spellers, teachers of spelling and reading, and students of English words”

https://www.etymonline.com Online etymological dictionary

https://en.wikipedia.org/wiki/English-language_spelling_reform

Further reading: Chapter 1 in Cummings, D.W. (1988). American English Spelling. JHU Press: Baltimore.