Skip to main content

The Science of Seeing


The Science of Seeing
Our goal in this series of lessons is to try and understand how you use your visual sense to understand the world around you. I can’t do better than David Marr, who said that vision was really all about knowing “what was where by looking.” This is what we want to figure out: How do you do that? How does your vision work?

This may seem like an odd question to ask. Your vision feels effortless. Look around you and there’s the world: People you can identify, objects you can recognize, colors, surfaces, textures, all of which you quickly grok without any trouble. What is there to understand?

Here’s something to consider: Look at the picture below and see if you can tell the difference between the Dalmatians and the pictures of chocolate-chip ice cream. If you want to try and do this under time pressure, you can follow the link below the picture to visit sporcle.com and actually get a score. My guess is that you’ll do pretty well.

 

Figure 1 - Visit https://www.sporcle.com/games/sproutcm/dalmatian_not_choc_chip to see if you actually know what a pile of dogs looks like.

But how did you do that? Imagine you have to try and describe to an alien (who knows nothing about dogs or dessert) how to do this task. What’s the trick? My guess is that you might say something like, “Look for eyes.” or “If you see legs and a body, then it’s a Dalmatian.” But both of these kinds of statements push the question down the road without really helping! How do you know something is an eye, a leg, or a body? “Look for fur.” isn’t good advice if you can’t tell me what fur is and what it looks like. So what exactly are you doing when you choose which images are which?

You’re doing something with patterns of light. You’re sensing them (by which we mean something like measuring or encoding them) and you’re perceiving them (by which we mean interpreting what they imply about objects and surfaces in the world). You’re using your eyes and a good bit of your brain to do it. You’re also leveraging a lot of different processes that bridge the gap between a real Dalmatian out in the world and your experience of seeing that Dalmatian – processes that include inferring color, edges, surfaces, volumes, materials, and category labels from patterns of light in a cluttered, complex, and changing world.

This is a lot to understand, but we have two specific things going for us: (1) Compared to other sciences, we get to do experiments by just looking at things. Our visual experiences are our data! (2) We have many quantitative models of human vision that help us account for why we see what we do. (3) We have a range of tools for examining the anatomy of the visual system. We’re going to use all three of these approaches to try and explain how you sense and perceive the world around you using your vision. Along the way, we’ll end up talking a little about physics, a little about physiology, and a good bit about computation, all in service of characterizing what vision is and how it works.

Compared to other descriptions of human vision, this discussion is going to be based as much as possible on observations that are easy to make with accessible materials. At times we’ll have to compromise on this, but for the most part we’ll always try to appeal to your observations to ground what we’re doing in your own visual sense. In large part, this will be formalized in the lab exercises referenced throughout these posts. You can find supporting documents for these exercises at the links provided with each post, but you’ll have to find some of your own materials like lasers, lenses, scissors, etc. Some of these things won’t be too hard to find, but others may be trickier. I’ve included a list of materials and a few links to helpful vendors in case you want to get your own stuff and try these things out.

Besides observation, this discussion is also going to involve describing how to use specific computations to model what the visual system is doing. If you’re not a fan of math, I make no apologies for this: Understanding vision, really understanding it, means understanding how these models work, how to use them to make predictions, and understanding their limits. I believe it’s not enough to say “I get the concepts…” without being able to do some work calculating how specific mechanisms work. Luckily, we’ll stay pretty close to addition, multiplication, and just a tiny bit of trigonometry. The tricky part is understanding why we’re doing the mathematics that we’re doing.

With that, we’re ready to get started! We’ll begin by considering the very first part of seeing: Having something to see in the first place. That is, we’ll begin by talking about light.



Comments

Popular posts from this blog

Lab #4 - Observing retinal inhomgeneities

Lab #4 - Observing retinal inhomgeneities Back-to-back lab activities, but there's a method to the madness: In this set of exercises, you'll make a series of observations designed to show off how your ability to see depends on which part of your retina you're trying to see with. Here's a link to the lab document: https://drive.google.com/file/d/1VwIY1bDNF4CI4CUVaY5WSvQ0HxF9Mn6Y/view When you're done here, we're ready to start saying more about the retina and how it works. Our next posts will be all about developing a model that we can use to describe the retina's contribution to your vision quantitatively, so get ready to calculate some stuff!

Lab #3 - Photopigments

Lab #3 - Photopigments Our next task is to work out how you translate the image formed in the back of a pinhole camera into some kind of signal that your nervous system can work with. We'll start addressing this question by examining photopigments  in Lab #3. To complete this lab, you'll need access to some sunprint paper, which is available from a variety of different sources. Here's where I bought mine:  http://www.sunprints.org . You can find the lab documents at the link below: https://drive.google.com/file/d/17MVZqvyiCRdT_Qu5n_CtK3rVcUP0zoOG/view When you're done, move on to the Lab #4 post to make a few more observations that will give us a little more information about the retina. Afterwards, we'll try to put all of this together into a more comprehensive description of what's happening at the back of the eye.

Color Constancy: Intro

Color Constancy: Estimating object and surface color from the data. In our last post, we introduced a new kind of computation that we said was supposed to help us achieve something called perceptual constancy . That term referred to the ability to maintain some kind of constant response despite a pattern of light that was changing. For example, complex cells in V1 might be able to continue responding the same way to a line or edge that was at different positions in the visual field. This would mean that even when an object changed position over time because you or the object were moving, your complex cells might be able to keep doing the same thing throughout that movement. This is a useful thing to be able to do because your visual world changes a lot as time passes, but in terms of the real objects and surfaces that you’re looking at, the world is pretty stable. Think about it: If you just move your eyes around the room you’re sitting in, your eyes will get very different pattern