-
Notifications
You must be signed in to change notification settings - Fork 290
WhatsNew–Python3.0
Lamparter edited this page Nov 18, 2024
·
1 revision
Important
This file is used by maintainers to track implementation of core functionality from Python 3 into IronPython.
https://docs.python.org/3/whatsnew/3.0.html
-
dict
methodsdict.keys()
,dict.items()
anddict.values()
return "views" instead of lists. For example, this no longer works:k = d.keys(); k.sort()
. Usek = sorted(d)
instead (this works in Python 2.5 too and is just as efficient). - Also, the
dict.iterkeys()
,dict.iteritems()
anddict.itervalues()
methods are no longer supported. -
map()
andfilter()
return iterators. If you really need a list, a quick fix is e.g.list(map(...))
, but a better fix is often to use a list comprehension (especially when the original code uses lambda), or rewriting the code so it doesn't need a list at all. Particularly tricky ismap()
invoked for the side effects of the function; the correct transformation is to use a regular for loop (since creating a list would just be wasteful). -
zip()
now returns an iterator.
- The ordering comparison operators (
<
,<=
,>=
,>
) raise aTypeError
exception when the operands don't have a meaningful natural ordering. Thus, expressions like1 < ''
,0 > None
orlen <= len
are no longer valid, and e.g.None < None
raisesTypeError
instead of returningFalse
. A corollary is that sorting a heterogeneous list no longer makes sense - all the elements must be comparable to each other. Note that this does not apply to the==
and!=
operators: objects of different incomparable types always compare unequal to each other. -
builtin.sorted()
andlist.sort()
no longer accept thecmp
argument providing a comparison function. Use thekey
argument instead. N.B. thekey
andreverse
arguments are now "keyword-only" - The
cmp()
function should be treated as gone, and the__cmp__()
special method is no longer supported. Use__lt__()
for sorting,__eq__()
with__hash__()
, and other rich comparisons as needed. (If you really need thecmp()
functionality, you could use the expression(a > b) - (a < b)
as the equivalent forcmp(a, b)
.)
- PEP 0237: Essentially,
long
renamed toint
. That is, there is only one built-in integral type, namedint
; but it behaves mostly like the oldlong
type. - PEP 0238: An expression like
1/2
returns afloat
. Use1//2
to get the truncating behavior. (The latter syntax has existed for years, at least since Python 2.2.) - The
sys.maxint
constant was removed, since there is no longer a limit to the value of integers. However,sys.maxsize
can be used as an integer larger than any practical list or string index. It conforms to the implementation's "natural" integer size and is typically the same assys.maxint
in previous releases on the same platform (assuming the same build options). - The
repr()
of along
integer doesn't include the trailingL
anymore, so code that unconditionally strips that character will chop off the last digit instead. (Usestr()
instead.) - Octal literals are no longer of the form
0720
; use0o720
instead.
- Python 3.0 uses the concepts of text and (binary) data instead of Unicode strings and 8-bit strings. All text is Unicode; however encoded Unicode is represented as binary data. The type used to hold text is
str
, the type used to hold data isbytes
. The biggest difference with the 2.x situation is that any attempt to mix text and data in Python 3.0 raisesTypeError
, whereas if you were to mix Unicode and 8-bit strings in Python 2.x, it would work if the 8-bit string happened to contain only 7-bit (ASCII) bytes, but you would getUnicodeDecodeError
if it contained non-ASCII values. This value-specific behavior has caused numerous sad faces over the years. - As a consequence of this change in philosophy, pretty much all code that uses Unicode, encodings or binary data most likely has to change. The change is for the better, as in the 2.x world there were numerous bugs having to do with mixing encoded and unencoded text. To be prepared in Python 2.x, start using
unicode
for all unencoded text, andstr
for binary or encoded data only. Then the2to3
tool will do most of the work for you. -
You can no longer use(Readded in Python 3.3). However, you must useu"..."
literals for Unicode text.b"..."
literals for binary data. - As the
str
andbytes
types cannot be mixed, you must always explicitly convert between them. Usestr.encode()
to go fromstr
tobytes
, andbytes.decode()
to go frombytes
tostr
. You can also usebytes(s, encoding=...)
andstr(b, encoding=...)
, respectively. - Like
str
, thebytes
type is immutable. There is a separate mutable type to hold buffered binary data,bytearray
. Nearly all APIs that acceptbytes
also acceptbytearray
. The mutable API is based oncollections.MutableSequence
. - All backslashes in raw string literals are interpreted literally. This means that
'\U'
and'\u'
escapes in raw strings are not treated specially. For example,r'\u20ac'
is a string of 6 characters in Python 3.0, whereas in 2.6,ur'\u20ac'
was the single "euro" character. (Of course, this change only affects raw string literals; the euro character is'\u20ac'
in Python 3.0.) - The builtin
basestring
abstract type was removed. Usestr
instead. Thestr
andbytes
types don't have functionality enough in common to warrant a shared base class. The2to3
tool (see below) replaces every occurrence ofbasestring
withstr
. - Files opened as text files (still the default mode for
open()
) always use an encoding to map between strings (in memory) and bytes (on disk). Binary files (opened with ab
in the mode argument) always use bytes in memory. This means that if a file is opened using an incorrect mode or encoding, I/O will likely fail loudly, instead of silently producing incorrect data. It also means that even Unix users will have to specify the correct mode (text or binary) when opening a file. There is a platform-dependent default encoding, which on Unixy platforms can be set with theLANG
environment variable (and sometimes also with some other platform-specific locale-related environment variables). In many cases, but not all, the system default is UTF-8; you should never count on this default. Any application reading or writing more than pure ASCII text should probably have a way to override the encoding. There is no longer any need for using the encoding-aware streams in thecodecs
module. - Filenames are passed to and returned from APIs as (Unicode) strings. This can present platform-specific problems because on some platforms filenames are arbitrary byte strings. (On the other hand, on Windows filenames are natively stored as Unicode.) As a work-around, most APIs (e.g.
open()
and many functions in theos
module) that take filenames accept bytes objects as well as strings, and a few APIs have a way to ask for abytes
return value. Thus,os.listdir()
returns a list ofbytes
instances if the argument is abytes
instance, andos.getcwdb()
returns the current working directory as abytes
instance. Note that whenos.listdir()
returns a list of strings, filenames that cannot be decoded properly are omitted rather than raisingUnicodeError
. - Some system APIs like
os.environ
andsys.argv
can also present problems when the bytes made available by the system is not interpretable using the default encoding. Setting theLANG
variable and rerunning the program is probably the best approach. - PEP 3138: The
repr()
of a string no longer escapes non-ASCII characters. It still escapes control characters and code points with non-printable status in the Unicode standard, however. - PEP 3120: The default source encoding is now UTF-8.
- PEP 3131: Non-ASCII letters are now allowed in identifiers. (However, the standard library remains ASCII-only with the exception of contributor names in comments.)
- The
StringIO
andcStringIO
modules are gone. Instead, import theio
module and useio.StringIO
orio.BytesIO
for text and data respectively.
- PEP 3107: Function argument and return value annotations. This provides a standardized way of annotating a function's parameters and return value. There are no semantics attached to such annotations except that they can be introspected at runtime using the
__annotations__
attribute. The intent is to encourage experimentation through metaclasses, decorators or frameworks. - PEP 3102: Keyword-only arguments. Named parameters occurring after
*args
in the parameter list must be specified using keyword syntax in the call. You can also use a bare*
in the parameter list to indicate that you don't accept a variable-length argument list, but you do have keyword-only arguments. - Keyword arguments are allowed after the list of base classes in a class definition. This is used by the new convention for specifying a metaclass (see next section), but can be used for other purposes as well, as long as the metaclass supports it.
- PEP 3104:
nonlocal
statement. Usingnonlocal x
you can now assign directly to a variable in an outer (but non-global
) scope.nonlocal
is a new reserved word. - PEP 3132: Extended Iterable Unpacking. You can now write things like
a, b, *rest = some_sequence
. And even*rest, a = stuff
. Therest
object is always a (possibly empty) list; the right-hand side may be any iterable - Dictionary comprehensions:
{k: v for k, v in stuff}
means the same thing asdict(stuff)
but is more flexible. (This is PEP 0274 vindicated. :-) - Set literals, e.g.
{1, 2}
. Note that{}
is an empty dictionary; useset()
for an empty set. Set comprehensions are also supported; e.g.,{x for x in stuff}
means the same thing asset(stuff)
but is more flexible. - New octal literals, e.g.
0o720
(already in 2.6). The old octal literals (0720
) are gone. - New binary literals, e.g.
0b1010
(already in 2.6), and there is a new corresponding builtin function,bin()
. - Bytes literals are introduced with a leading
b
orB
, and there is a new corresponding builtin function,bytes()
.
- PEP 3109 and PEP 3134: new
raise
statement syntax:raise [expr [from expr]]
. -
as
andwith
are now reserved words. (Since 2.6, actually.) -
True
,False
, andNone
are reserved words. (2.6 partially enforced the restrictions onNone
already.) - Change from
except exc, var
toexcept exc as var
. See PEP 3110 - PEP 3115: New Metaclass Syntax
- List comprehensions no longer support the syntactic form
[... for var in item1, item2, ...]
. Use[... for var in (item1, item2, ...)]
instead. - The ellipsis (
...
) can be used as an atomic expression anywhere. (Previously it was only allowed in slices.) Also, it must now be spelled as...
. (Previously it could also be spelled as. . .
, by a mere accident of the grammar.)
- PEP 3113: Tuple parameter unpacking removed. You can no longer write
def foo(a, (b, c)): ....
Usedef foo(a, b_c): b, c = b_c
instead. - Removed backticks (use
repr()
instead). - Removed
<>
(use!=
instead). - Removed keyword:
exec()
is no longer a keyword; it remains as a function - Integer literals no longer support a trailing
l
orL
. -
String literals no longer support a leading(Readded in Python 3.3)u
orU
. - The
from module import *
syntax is only allowed at the module level, no longer inside functions. - The only acceptable syntax for relative imports is
from .[module] import name
. All import forms not starting with.
are interpreted as absolute imports. (PEP 0328) - Classic classes are gone.
-
_winreg
renamed towinreg
-
copy_reg
renamed tocopyreg
- Cleanup of the
sys
module: removedsys.exitfunc()
,sys.exc_clear()
,sys.exc_type
,sys.exc_value
,sys.exc_traceback
. - Cleanup of the
array.array
type: theread()
andwrite()
methods are gone; usefromfile()
andtofile()
instead. Also, the'c'
typecode for array is gone - use either'b'
for bytes or'u'
for Unicode characters. - Cleanup of the operator module: removed
sequenceIncludes()
andisCallable()
. - Cleanup of the
thread
module:acquire_lock()
andrelease_lock()
are gone; useacquire()
andrelease()
instead. - Cleanup of the
random
module: removed thejumpahead()
API. - The functions
os.tmpnam()
,os.tempnam()
andos.tmpfile()
have been removed in favor of thetempfile
module. -
string.letters
and its friends (string.lowercase
andstring.uppercase
) are gone. Usestring.ascii_letters
etc. instead. (The reason for the removal is thatstring.letters
and friends had locale-specific behavior, which is a bad idea for such attractively-named global "constants".)
PEP 3101: A New Approach To String Formatting
- A new system for built-in string formatting operations replaces the
%
string formatting operator. (However, the%
operator is still supported; it will be deprecated in Python 3.1 and removed from the language at some later time.) Read PEP 3101 for the full scoop.
- PEP 0352: All exceptions must be derived (directly or indirectly) from
BaseException
. This is the root of the exception hierarchy. This is not new as a recommendation, but the requirement to inherit fromBaseException
is new. (Python 2.6 still allowed classic classes to be raised, and placed no restriction on what you can catch.) As a consequence, string exceptions are finally truly and utterly dead. - Almost all exceptions should actually derive from
Exception
;BaseException
should only be used as a base class for exceptions that should only be handled at the top level, such asSystemExit
orKeyboardInterrupt
. The recommended idiom for handling all exceptions except for this latter category is to useexcept Exception
. -
StandardError
was removed (in 2.6 already). - Exceptions no longer behave as sequences. Use the
args
attribute instead. - PEP 3109: Raising exceptions. You must now use
raise Exception(args)
instead ofraise Exception, args
. Additionally, you can no longer explicitly specify a traceback; instead, if you have to do this, you can assign directly to the__traceback__
attribute (see below). - PEP 3110: Catching exceptions. You must now use
except SomeException as variable
instead ofexcept SomeException, variable
. Moreover, the variable is explicitly deleted when theexcept
block is left. - PEP 3134: Exception chaining. There are two cases: implicit chaining and explicit chaining. Implicit chaining happens when an exception is raised in an
except
orfinally
handler block. This usually happens due to a bug in the handler block; we call this a secondary exception. In this case, the original exception (that was being handled) is saved as the__context__
attribute of the secondary exception. Explicit chaining is invoked with this syntax:raise SecondaryException() from primary_exception
(whereprimary_exception
is any expression that produces an exception object, probably an exception that was previously caught). In this case, the primary exception is stored on the__cause__
attribute of the secondary exception. The traceback printed when an unhandled exception occurs walks the chain of__cause__
and__context__
attributes and prints a separate traceback for each component of the chain, with the primary exception at the top. (Java users may recognize this behavior.) - PEP 3134: Exception objects now store their traceback as the
__traceback__
attribute. This means that an exception object now contains all the information pertaining to an exception, and there are fewer reasons to usesys.exc_info()
(though the latter is not removed). - A few exception messages are improved when Windows fails to load an extension module. For example, error code 193 is now
%1 is not a valid Win32 application
. Strings now deal with non-English locales.
-
!=
now returns the opposite of==
, unless==
returnsNotImplemented
. - The concept of "unbound methods" has been removed from the language. When referencing a method as a class attribute, you now get a plain function object.
-
__getslice__()
,__setslice__()
and__delslice__()
were killed. The syntaxa[i:j]
now translates toa.__getitem__(slice(i, j))
(or__setitem__()
or__delitem__()
, when used as an assignment or deletion target, respectively). - PEP 3114: the standard
next()
method has been renamed to__next__()
. - The
__oct__()
and__hex__()
special methods are removed -oct()
andhex()
use__index__()
now to convert the argument to an integer. - Removed support for
__members__
and__methods__
. - The function attributes named
func_X
have been renamed to use the__X__
form, freeing up these names in the function attribute namespace for user-defined attributes. To wit,func_closure
,func_code
,func_defaults
,func_dict
,func_doc
,func_globals
,func_name
were renamed to__closure__
,__code__
,__defaults__
,__dict__
,__doc__
,__globals__
,__name__
, respectively. -
__nonzero__()
is now__bool__()
.
- PEP 3135: New
super()
. You can now invokesuper()
without arguments and (assuming this is in a regular instance method defined inside a class statement) the right class and instance will automatically be chosen. With arguments, the behavior ofsuper()
is unchanged. - PEP 3111:
raw_input()
was renamed toinput()
. That is, the newinput()
function reads a line fromsys.stdin
and returns it with the trailing newline stripped. It raisesEOFError
if the input is terminated prematurely. To get the old behavior ofinput()
, useeval(input())
. - A new builtin
next()
was added to call the__next__()
method on an object. - Moved
intern()
tosys.intern()
- Removed:
apply()
. Instead ofapply(f, args)
usef(*args)
. -
Removed(Readded in Python 3.2) Thecallable()
. Instead ofcallable(f)
you can usehasattr(f, '__call__')
.operator.isCallable()
function is also gone. - Removed
coerce()
. This function no longer serves a purpose now that classic classes are gone. - Removed
execfile()
. Instead ofexecfile(fn)
useexec(open(fn).read())
. - Removed
file
. Useopen()
. - Removed
reduce()
. Usefunctools.reduce()
- Removed
reload()
. Useimp.reload()
. - Removed
dict.has_key()
- use thein
operator instead.
Still looking for more? Send a message by creating a feature request on the Issues tab.
🐍 IronPython