Inf1 Cognitive Science: Week 9 Lab

1. The Word Superiority Effect

Read through this Wikipedia article carefully.

Discuss the following questions with your partner:

How were the experiments set up?
What was the main result?

2. Feature detectors for letters

As any user of an electronic calculator knows, the ten Arabic digits can be represented as an array of seven features - three horizontal lines and four vertical lines.

seven

This gives us an idea for encoding the input layer for a letter/word recognition neural network - we have seven input perceptrons, each of which is either on or off.

Is it possible to represent every letter in the English alphabet using these seven feature detectors?

You may want to check out this page for inspiration. Or maybe this one.

The seven segment display is OK, but not particularly good at representing English letters. Can you come up with something better? How many feature detectors (or input perceptrons) do you need?

3. The basic letter recognition task

The basic letter recognition task involves taking a visual image as input and deciding which letter of the English alphabet the input best corresponds to.

Based on the set of feature detectors you came up with in the previous question, and what you know about simple, two-layer, feed forward pattern associator neural networks, sketch out a quick plan for building and training a system that can do this task.

Write down a couple of examples of "dirty" input to the system. What should your trained neural network output for these cases?

The output layer of this pattern associator has a serious deficiency for the letter recognition task - think about how many output perceptrons we want to fire at any one time. What is the problem? How might we go about using lateral inhibition to solve it?

4. The basic four-letter-word recognition task

Think about the following questions:

How many possible four-letter sequences are there?
How many actual English four-letter-words are there? If you don't know the answer, how might you go about finding out?
How many potential English four-letter-words are there? How might you go about working it out?

Run the following commands in your UNIX terminal:

      [verve]mmcconvi: python
      >>> import nltk
      >>> from nltk.corpus import brown
      >>> flws = [w.lower() for w in brown.words() if len(w)==4 and w.isalpha()]
      >>> flws
      >>> len(set(flws))
      >>> f = nltk.FreqDist(flws)
      >>> flws2 = [w for (w,c) in f.items() if c>1]
      >>> len(set(flws2))

What does this tell you?

The word recognition task involves taking a sequence of four letters as input and deciding which valid English four-letter-word the input best corresponds to.

Based on the output layer you came up with in the previous question, and what you know about simple, two-layer, feed forward pattern associator neural networks, sketch out a quick plan for building and training a system that can do this task. Would lateral inhibition help?

Write down a couple of examples of "dirty" input to this system. What should your trained neural network output for these cases?

5. Combining the systems

Now we are interested into building a system which combines the previous two networks you designed. The input to this composite system will be a visual representation of a four-letter-word (encoded in terms of letter features) and the output will be the proper English word it is most likely to represent.

Sketch out a combined three-layer network, that incorporates both networks you defined in the previous two questions in an intuitive way.

How many input perceptrons will you need? What will they each represent? How many on each of the other two layers?

6. Accounting for the Word Superiority Effect

The WSE suggests that the letter recognition task is influenced by context. For example, if the letter in positions 2-4 are HIP, then what does this tell us about the letter in position 1? And if the letter in position 1 is a Q, what does this tell us about the letters in positions 2 and 3?

How might you go about improving your combined network to account for the context effect in letter recognition? Think about what extra connections you could add, and whether they should be excitatory or inhibitory.

7. McClelland & Rumelhart's Interactive Activation model of letter perception

The relevant model is discussed here. You don't have time to read this just now, but I recommend you come back to it at your leisure. What you should do just now is examine Figures 2, 3 and 4 carefully and compare them with what you came up with. pay particular attention to the the distinction between connections that end in little black dots and those that end with little black triangles.

There is an online simulation of the model here. Read the instructions carefully and play about with it for a bit. Some examples of input sequences to try out are "chip", "ghip", "work", "wor*", etc. Make sure you understand what the output is telling you. You can look at both output words, or just the output letters in particular positions.

8. The Jets and Sharks example

Another example of an interactive activation model involves the Jets and Sharks gangs. You can see a visual simulation of this model here. And you can read about it in Ex 2.1 of the PDP manual. Or here.

Play with the visual simulation, hovering over each of the perceptrons to see what happens (Don't click on them yet, just hover!). Think about the following:

IAC models involve pools of perceptrons. Inside a pool, every perceptron is connected to every other perceptron by an inhibitory link. Why? Perceptrons may be linked to perceptrons in other pools by excitatory links. Why? How many pools does the Jets and Sharks model contain? What does each pool represent?
IAC models typically contain both visible perceptrons (which are accessible from outside the network) and hidden perceptrons (which are not). In this model, which perceptrons do you think are the hidden ones?
Click on a perceptron and watch as the activation levels gradually settle - the units that remain active are those that are most compatible with your input. Reset using the 'R' key and try again.
Reset and click on Ken's name unit. Try and answer Q.2.1.1 and Q2.1.2 in the PDP manual exercises.
Ken is the only shark in his 20s. Reset and click on the In20s input unit followed by the Sharks input unit. Think about Q.2.1.4.
Ask the network the following questions: What are Jets typically like? How about Sharks? How about 40something pushers?

X. Further reading

The Science of Word Recognition by Kevin Larson.

West Side Story.