Our first experiments with Python will take place in Python's "interactive interpreter." This is an interface that allows you to type Python code and see the results of your code instantly. Later on, we'll make Python programs that live in their own little files; but for now, we'll just type simple programs directly into the interpreter.
To launch the Python interactive interpreter, simply type the word python
on the command-line:
$ python
You'll see something that looks like this:
Python 2.6.9 (unknown, Mar 28 2014, 00:06:37)
[GCC 4.8.2 20131212 (Red Hat 4.8.2-7)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>>
The >>>
is the interactive interpreter's prompt. It means, "type some Python please."
As with all good things, Python comes in multiple "versions." The latest version of Python is Python 3, which has a number of design improvements and features. In this class, however, we'll be using Python 2 (specifically, Python 2.6 or newer). Why go with the old version? Mainly because much of the tutorial material available to students on the Internet is still based on Python 2, and many of the third-party libraries that we may want to use don't fully support Python 3 yet. The main reason you need to know this information is that you should be careful when looking up Python information on the Internet---make sure whatever tutorial you're looking at is about Python 2.x, not Python 3.
Let's start with a very high-level description of how computer programming works. When you're writing a computer program, you're describing to the computer what you want, and then asking the computer to figure that thing out for you. Your description of what you want is called an expression. The process that the computer uses to turn your expression into whatever that expression means is called evaluation.
Think of a science fiction movie where a character asks the computer, out loud, "What's the square root of nine billion?" or "How many people older than 50 live in Paris, France?" Those are examples of expressions. The process that the computer uses to transform those expressions into a response is evaluation.
When the process of evaluation is complete, you're left with a single "value". Think of it schematically like so:
What makes computer programs powerful is that they make it possible to write very precise and sophisticated expressions. And importantly, you can embed the results of evaluating one expression inside of another expression, or save the results of evaluating an expression for later in your program.
Unfortunately, computers can't understand and intuit your desires simply from a verbal description. That's why we need computer programming languages: to give us a way to write expressions in a way that the computer can understand. Because programming languages are designed to be precise, they can also be persnickety (and frustrating). And every programming language is different. It's tricky, but worth it.
Let's start with simple arithmetic expressions. The way that you write arithmetic expressions in Python is very similar to the way that you write arithmetic expressions in, say, grade school arithmetic, or algebra. In the example below, 3 + 5
is the expression. In the interactive interpreter, you can tell Python to evaluate the expression and display its value simply by typing in the expression right after the prompt (>>>
).
>>> 3 + 5
Arithmetic expressions in Python can be much more sophisticated than this, of course. We won't go over all of the details right now, but one thing you should know immediately is that Python arithmetic operations are evaluated using the typical order of operations, which you can override with parentheses:
>>> 4 + 5 * 6
>>> (4 + 5) * 6
You can write arithmetic expressions with or without spaces between the numbers and the operators (but usually it's considered better style to include spaces):
>>> 10+20+30
Expressions in Python can also be very simple. In fact, a number on its own is its own expression, which Python evaluates to that number itself:
>>> 19
If you write an expression that Python doesn't understand, then you'll get an error. Here's what that looks like:
>>> + 20 19
You can also ask Python whether two expressions evaluate to the same value, or if one expression evaluates to a value greater than another expression, using a similar familiar syntax. When evaluating such expressions, Python will return one of two special values: either True
or False
.
The ==
operator compares the expression on its left side to the expression on its right side. It evaluates to True
if the values are equal, and False
if they're not equal.
>>> 3 * 5 == 9 + 6
>>> 20 == 7 * 3
The <
operator compares the expression on its left side to the expression on its right side, evaluating to True
if the left-side expression is less than the right-side expression, False
otherwise. The >
does the same thing, except checking to see if the left-side expression is greater than the right-side expression:
>>> 17 < 18
>>> 17 > 18
The >=
and <=
operators translate to "greater than or equal" and "lesser than or equal," respectively:
>>> 22 >= 22
>>> 22 <= 22
Make sure to get the order of the angle bracket and the equal sign right!
>>> 22 =< 22
You can save the result of evaluating an expression for later using the =
operator (called the "assignment operator"). On the left-hand side of the =
, write a word that you'd like to use to refer to the value of the expression, and on the right-hand side, write the expression itself. After you've assigned a value like this, whenever you include that word in your Python code, Python will evaluate the word and replace it with the value you assigned to it earlier. Like so:
>>> x = (4 + 5) * 6
>>> x
(Notice that the line x = (4 + 5) * 6
didn't cause Python to print anything out. An assignment in Python isn't an expression, it's a "statement"---we'll discuss the difference later.)
Now, whenever you use the variable x
in your program, it "stands in" for the result of the expression that you assigned to it.
>>> x / 6
You can create as many variables as you want!
>>> another_variable = (x + 2) * 4
>>> another_variable
Variable names can contain letters, numbers and underscores, but must begin with a letter or underscore. There are other, more technical constraints on variable names; you can review them here.
If you attempt to use a the name of a variable that you haven't defined in the notebook, Python will raise an error:
>>> voldemort
If you assign a value to a variable, and then assign a value to it again, the previous value of the variable will be overwritten:
>>> x = 15
>>> x
>>> x = 42
>>> x
The fact that variables can be overwritten with new values can be helpful in some contexts (e.g., if you're writing a program and you're using the variable to keep track of some value that changes over time). But it can also be annoying if you use the same variable name twice on accident and overwrite values in one part of your program that another part of your program is using the same variable name to keep track of!
Another important thing to know is that when Python evaluates an expression, it assigns the result to a "type." A type is a description of what kind of thing a value is, and Python uses that information to determine later what you can do with that value, and what kinds of expressions that value can be used in. You can ask Python what type it thinks a particular expression evaluates to, or what type a particular value is, using the type()
function:
>>> type(100 + 1)
The word int stands for "integer." ("Integers" are numbers that don't have a fractional component, i.e., -2, -1, 0, 1, 2, etc.) Python has many, many other types, and lots of (sometimes arcane) rules for how those types interact with each other when used in the same expression. For example, you can create a floating point type (i.e., a number with a decimal point in it) by writing a number with a decimal point in it:
>>> type(3.14)
Interestingly, the result of adding a floating-point number and an integer number together is always a floating point number:
>>> type(3.14 + 17)
... and the result of dividing one integer by another integer is always an integer:
>>> type(4 / 3)
Throwing an expression into the type()
function is a good way to know whether or not the value you're working with is the value you were expecting to work with. We'll use it for debugging some example code later.
Another type of value in Python is called a "string." Strings are a way of representing in our computer programs stretches of text: one or more letters in sequential order. To make an expression that evaluates to a string in Python, simply enclose some text inside of quotes and put it into the interactive interpreter:
>>> "Suppose there is a pigeon, suppose there is."
Asking Python for the type of a string returns str
:
>>> type("Suppose there is a pigeon, suppose there is.")
You can use single quotes or double quotes to enclose strings (I tend to use them interchangeably), as long as the opening quote matches the closing quote:
>>> 'Suppose there is a pigeon, suppose there is.'
(When you ask Python to evaluate a string expression, it will display it with single quotes surrounding it.)
You can assign strings to variables, just like any other value:
>>> roastbeef = "Suppose there is a pigeon, suppose there is."
>>> roastbeef
Normally, if there are any characters you want in your string, all you have to do to put them there is type the characters in on your keyboard, or paste in the text that you want from some other source. There are some characters, however, that require special treatment and can't be typed into a string directly.
For example, say you have a double-quoted string. Now, the rules about quoting strings (as outlined above) is that the quoted string begins with a double-quote character and ends with a double-quote character. But what if you want to include a double-quote character INSIDE the string? You might think you could do this:
>>> "And then he said, "I think that's a cool idea," and vanished."
But that won't work:
>>> "And then he said, "I think that's a cool idea," and vanished."
It doesn't work because Python interprets the first double-quote it sees after the beginning of the string as the double-quote that marks the end of the string. Then it sees all of the stuff after the string and says, "okay, the programmer must not be having a good day?" and displays a syntax error. Clearly, we need a way to tell Python "I want you to interpret this character not with the special meaning it has in Python, but LITERALLY as the thing that I typed."
We can do this exact thing by putting a backslash in front of the characters that we want Python to interpret literally, like so:
>>> "And then he said, \"I think that's a cool idea,\" and vanished."
A character indicated in this way is called an "escape" character (because you've "escaped" from the typical meaning of the character). There are several other useful escape characters to know about:
\"
above, but you can also use \'
in a single-quoted string.\n
if you want to include a new line in your string.\t
instead of hitting the tab key to put a tab in your string.\
is itself the character used to escape other characters, you need to type \\
if you actually want a backslash in your string.You might have noticed that when you evaluate a string as an expression in the interactive interpreter, Python displays the string with quotes around it, and with escape characters intact, like so:
>>> "\tA \"string\" with escape\ncharacters."
By default, the interactive interpreter doesn't "interpolate" escape characters in strings when it displays them. ("Interpolate" is a fancy computer programming term that means "replace symbols in something with whatever those symbols represent.") Python does this so that the result of evaluating an expression is itself a valid expression that you can cut-and-paste right back into the interactive interpreter.
If you want Python to actually display \t
as a tab, and \n
as a new line, use the print
statement, like this:
>>> print "\tA \"string\" with escape\ncharacters."
(We'll be using the print
statement a LOT in this class, especially when we get to standalone scripts outside of the interactive interpreter.)
Sometimes we want to work with strings that have more than one "line" of text in them. The problem with this is that Python interprets your having pressed "Enter" with your having finished your input, so if you try to cut-and-paste in some text with new line characters, you'll get an error:
>>> poem = "Rose, harsh rose,
marred and with stint of petals,
meagre flower, thin,
spare of leaf,"
(EOL while scanning string literal
is Python's way of saying "you hit enter too soon.") One way to work around this is to include \n
(newline character) inside the string when we type it into our program:
>>> poem = "Rose, harsh rose,\nmarred and with stint of petals,\nmeagre flower, thin,\nspare of leaf,"
>>> print poem
This works, but it's kind of inconvenient! A better solution is to use a different way of quoting strings in Python, the triple-quote. It looks like this:
>>> poem = """Rose, harsh rose,
marred and with stint of petals,
meagre flower, thin,
spare of leaf,"""
>>> print poem
When you use three quotes instead of one, Python allows you to put new line characters directly into the string. Nice! We'll be using this for some of the examples below.
Now that we can get some text into our program, let's talk about some of the ways Python allows us to do interesting things with that text.
Let's talk about the len()
function first. If you take an expression that evaluates to a string and put it inside the parentheses of len()
, you get an integer value that indicates how long the string is. Like so:
>>> len("Suppose there is a pigeon, suppose there is.")
The value that len()
evaluates to can itself be used in other expressions (just like any other value!):
>>> len("Camembert") + len("Cheddar")
Next up: the in
operator, which lets us check to see if a particular string is found inside of another string.
"foo" in "buffoon"
"foo" in "reginald"
The in
operator takes one expression evaluating to a string on the left and another on the right, and returns True
if the string on the left occurs somewhere inside of the string on the right.
We can check to see if a string begins with or ends with another string using that string's .startswith()
and .endswith()
methods, respectively:
>>> "foodie".startswith("foo")
>>> "foodie".endswith("foo")
The .isdigit()
method returns True
if Python thinks the string could represent an integer, and False
otherwise:
>>> "foodie".isdigit()
>>> "4567".isdigit()
And the .islower()
and .isupper()
methods return True
if the string is in all lower case or all upper case, respectively (and False
otherwise).
print "foodie".islower()
print "foodie".isupper()
print "YELLING ON THE INTERNET".islower()
print "YELLING ON THE INTERNET".isupper()
The in
operator discussed above will tell us if a substring occurs in some other string. If we want to know where that substring occurs, we can use the .find()
method. The .find()
method takes a single parameter between its parentheses: an expression evaluating to a string, which will be searched for within the string whose .find()
method was called. If the substring is found, the entire expression will evaluate to the index at which the substring is found. If the substring is not found, the expression evaluates to -1
. To demonstrate:
>>> "Now is the winter of our discontent".find("win")
>>> "Now is the winter of our discontent".find("lose")
The .count()
method will return the number of times a particular substring is found within the larger string:
>>> "I got rhythm, I got music, I got my man, who could ask for anything more".count("I got")
Finally, remember the ==
operator that we discussed earlier? You can use that in Python to check to see if two strings contain the same characters in the same order:
>>> "pants" == "pants"
>>> "pants" == "trousers"
Exercise: Create a variable called
poem
and assign the text of "Sea Rose" to that variable. Use thelen()
function to find out how many characters are in it. Then, use thecount()
method to find out how many times the stringrose
occurs within it.
Python strings have a number of different methods which, when called on a string, return a copy of that string with a simple transformation applied to it. These are helpful for normalizing and cleaning up data, or preparing it to be displayed.
Let's start with .lower()
, which evaluates to a copy of the string in all lower case:
"ARGUMENTATION! DISAGREEMENT! STRIFE!".lower()
The converse of .lower()
is .upper()
:
"e.e. cummings is. not. happy about this.".upper()
The method .title()
evaluates to a copy of the string it's called on, replacing every letter at the beginning of a word in the string with a capital letter:
"dr. strangelove, or, how I learned to love the bomb".title()
The .strip()
method removes any whitespace from the beginning or end of the string (but not between characters later in the string):
" got some random whitespace in some places here ".strip()
Finally, the .replace()
method takes two parameters: a string to find, and a string to replace that string with whenever it's found. You can use this to make sad stories.
"I got rhythm, I got music, I got my man, who could ask for anything more".replace("I got", "I used to have")
EXERCISE: Use the
.replace()
method to make "Sea Rose" about something else entirely. (Hint: try using the.replace()
method more than once!)
Okay, we're getting somewhere together! But I've still been using a lot of jargon when explaning this stuff. One thing that might confuse you: what's a "function" and what's a "method"?
We've talked about two "functions" so far: len()
and type()
. A function is a special word that you can use in Python expressions that runs some pre-defined code: you put your expression inside the parentheses, and Python sends the result of evaluating that expression to the code in the function. That code operates on the value that you gave it, and then itself evaluates to another value. Using a function in this way is usually called "calling" it or "invoking" it. The stuff that you put inside the parentheses is called a "parameter" or "argument"; the value that the function gives back is called its "return value."
The len()
and type()
functions are two of what are called "built-in functions," i.e. functions that come with Python and are available whenever you're writing Python code. In Python, built-in functions tend to be able to take many different types of value as parameters. (There are a lot of other built-in functions, not just len()
and type()
! We'll discuss them as the need arises.)
NOTE: You can also write your own functions---we'll learn how to do this later in the class. Writing functions is a good way to avoid repetition in your code and to compartmentalize it.)
"Methods" work a lot like functions, except in how it looks when you use them. Instead of putting the expression that you want to use them with inside the parentheses, you put the call to the method directly AFTER the expression that you want to call it on, following a period (.
). Methods, unlike built-in functions, are usually only valid for one type of value; e.g., values of the string type have a .strip()
method, but integer values don't.
It's important to remember that methods can be called both on an expression that evaluates to a particular value AND on a variable that contains that value. So you can do this:
>>> "hello".find('e')
...and this:
>>> s = "hello"
>>> s.find('e')
The interactive interpreter has all kinds of nuggets to help you program in Python. The first one worth mentioning is the help()
function:
>>> help()
Typing help()
starts Python's interactive "help utility," which gives you access to a number of tutorials and other helpful information. It's worth playing with for a while! To quit the help utility, type quit
.
You can also pass a parameter to help()
. If you use it this way, help()
will display a small snippet of explanatory text about whatever value you passed into the function. For example, to get help about the len()
function (make sure not to put the parentheses after len
):
>>> help(len)
(Press q
to dismiss this help screen.)
dir()
Remember above when we were talking about how certain types of value have certain "methods" that you can only use with that type of value? Sometimes it's helpful to be reminded of exactly which methods an object supports. You can find this out right in the interactive interpreter without having to look it up in the documentation using the dir()
built-in function. Just pass the value that you want to know more about to dir()
:
>>> dir("hello")
This is a list of all of the methods that the string type supports. (Ignore anything that begins with two underscores (__
) for now---those are special weird built-in methods that aren't very useful to call on their own.) If you want to know more about one method in particular, you can type this (note again that you need to NOT include the parentheses after the method):
>>> help("hello".swapcase)
Hey awesome! We've learned something about another string method. Let's try this method out:
>>> "Star Wars Episode IV: A New Hope".swapcase()
EXERCISE: Use
dir()
andhelp()
to find and research a string method that isn't mentioned in the notes. Then write an expression using that method.
Python has some powerful language constructions that allow you to access parts of the string by their numerical position in the string. You can get an individual character of a string by putting square brackets ([]
) right after an expression that evaluates to a string, and putting inside the square brackets the number that represents which character you want. Here's an example:
>>> "bungalow"[2]
You can also do this with variables that contain string values, of course:
>>> message = "bungalow"
>>> message[2]
If we were to say this expression out loud, it might read, "I have a string, consisting of the characters b
, u
, n
, g
, a
, l
, o
and w
, in that order. Give me back the second item in that string." Python evaluates that expression to n
, which is indeed the second letter in the word "bungalow."
You're right---good catch. But for reasons too complicated to go into here, Python (along with many other programming languages!) starts counting at 0, instead of 1. So what looks like the third letter of the string to human eyes is actually the second letter to Python. The first letter of the string is accessed using index 0, like so:
>>> message[0]
The way I like to conceptualize this is to think of list indexes not as specifying the number of the item you want, but instead specifying how "far away" from the beginning of the list to look for that value.
If you attempt to use a value for the index of a list that is beyond the end of the list (i.e., the value you use is higher than the last index in the list), Python gives you an error:
>>> message[17]
An individual character from a string still has the same type as the string it came from:
>>> type(message[3])
And, of course, a string containing an individual character has a length of 1:
>>> len(message[3])
The thing that goes inside of the index brackets doesn't have to be a number that you've just typed in there. Any Python expression that evaluates to an integer can go in there.
>>> message[6 / 2]
>>> x = 3
>>> message[x]
If you use -1
as the value inside of the brackets, something interesting happens:
>>> message[-1]
The expression evaluates to the last character in the string. This is essentially the same thing as the following code:
>>> message[len(message) - 1]
... except easier to write. In fact, you can use any negative integer in the brackets, and Python will count that many items from the end of the string, and the expression evaluates to that item.
>>> message[-3]
If the value in the brackets would "go past" the beginning of the list, Python will raise an error:
>>> message[-987]
The index bracket syntax explained above allows you to write an expression that evaluates to a character in a string, based on its position in the string. Python also has a powerful way for you to write expressions that return a section of a string, starting from a particular index and ending with another index. In Python parlance we'll call this section a slice.
Writing an expression to get a slice of a string looks a lot like writing an expression to get a single character. The difference is that instead of putting one number between square brackets, we put two numbers, separated by a colon. The first number tells Python where to begin the slice, and the second number tells Python where to end it.
>>> message[1:4]
Note that the value after the colon specifies at which index the slice should end, but the slice does not include the value at that index. I would translate the expression above as saying "give me characters one through four of the string in the "message" variable, NOT INCLUDING character four."
The fact that slice indexes aren't inclusive means that you can tell how long the slice will be by subtracting the value before the colon from the value after it:
>>> message[1:4]
>>> len(message[1:4])
>>> 4 - 1
Also note that---as always!---any expression that evaluates to an integer can be used for either value in the brackets. For example:
>>> x = 3
>>> message[x:x+2]
Finally, note that the type of a slice is still str
:
>>> type(message[5:7])
Because it's so common to use the slice syntax to get a string that is either a slice starting at the beginning of the string or a slice ending at the end of the string, Python has a special shortcut. Instead of writing:
>>> message[0:3]
You can leave out the 0
and write this instead:
>>> message[:3]
Likewise, if you wanted a slice that starts at index 4 and goes to the end of the string, you might write:
>>> message[4:]
Now for some tricky stuff: You can use negative index values in slice brackets as well! For example, to get a slice of a string from the fourth-to-last element of the string up to (but not including) the second-to-last element of the string:
>>> message[-4:-2]
(Even with negative slice indexes, the numbers have the property that subtracting the first from the second yields the length of the slice, i.e. -2 - (-4)
is 2
).
To get the last three elements of the string:
>>> message[:-3]
EXERCISE: Write an expression, or a series of expressions, that prints out "Sea Rose" from the first occurence of the string
sand
up until the end of the poem. (Hint: Use the.find()
method, discussed above.)
Earlier, we discussed how the +
operator can be used to create an expression that evaluates to the sum of two numbers. E.g.:
>>> 17 + 92
The +
operator can also be used to create a new string from two other strings. This is called "concatenation":
>>> "Spider" + "man"
>>> part1 = "Nickel, what is nickel, "
>>> part2 = "it is originally rid of a cover."
>>> print part1 + part2
You can combine as many strings as you want this way, using the +
operator multiple times in the same expression:
>>> "bas" + "ket" + "ball"
EXERCISE: Write an expression that evaluates to a string containing the first fifty characters of "Sea Rose" followed by the last fifty characters of "Sea Rose."
It's important to remember that a string that contains what looks like a number does not behave like an actual integer or floating point number does. For example, attempting to subtract one string containing a number from another string containing a number will cause an error to be raised:
>>> "15" - "4"
The "unsupported operand type(s)" error means that you tried to use an operator (in this case +
) with two types that the operator in question doesn't know how to work with. (Python is saying: "You asked me to subtract a string from another string. That doesn't make sense to me.")
Attempting to add an integer or floating-point number to a string that has (what looks like) a number inside of it will raise a similar error:
>>> 16 + "8.9"
Fortunately, there are built-in functions whose purpose is to convert from one type to another; notably, you can put a string inside the parentheses of the int()
and float()
functions, and it will evaluate to (what Python interprets as) the integer and floating-point values (respectively) of the string:
>>> type("17")
>>> int("17")
>>> type(int("17"))
>>> type("3.14159")
>>> float("3.14159")
>>> type(float("3.14159"))
If you give a string to one of these functions that Python can't interpret as an integer or floating-point number, Python will raise an error:
>>> int("shumai")
This section introduces many of the basic building blocks you'll need in order to use computer programs to write poems. We've talked about how to use the interactive interpreter, and about expressions and values, and about the distinction between functions and methods; and we've discussed the details of how strings work and how to manipulate them.
Further reading: