Indexing Atoms#

You can get an atom from any container by indexing it, as you would a Python list.

>>> print(mol[0])
Atom( N:1     [ -54.07,   11.27,   41.93] )

would get the first atom in the container, while

>>> print(mol[-1])
Atom( O:11732 [   4.73,    5.03,   46.82] )

gets the last atom.

Passing in an index of an atom that doesn’t exist results in an IndexError exception being raised, e.g.

>>> print(mol[100000])
---------------------------------------------------------------------------
IndexError                                Traceback (most recent call last)
Input In [32], in <cell line: 1>()
----> 1 print(mol[100000])

File ~/sire.app/lib/python3.8/site-packages/sire/mol/__init__.py:414, in __fixed__getitem__(obj, key)
    412         return obj.residue(key)
    413     else:
--> 414         return obj.atom(key)
    415 else:
    416     if __is_chain_class(obj):

IndexError: SireError::invalid_index: No item at index 100000. Index range is from -11728 to 11727. (call Sire.Error.get_last_error_details() for more info)

You can request multiple atoms by using a slice, e.g.

>>> print(mol[0:10])
Selector<SireMol::Atom>( size=10
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
2:  Atom( C:3     [ -56.06,    9.95,   42.55] )
3:  Atom( O:4     [ -57.04,    9.73,   41.82] )
4:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
5:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
6:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
7:  Atom( CD1:8   [ -55.42,   14.31,   43.09] )
8:  Atom( N:9     [ -55.50,    9.04,   43.36] )
9:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
)

gives the first 10 atoms, while

>>> print(mol[9::-1])
Selector<SireMol::Atom>( size=10
0:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
1:  Atom( N:9     [ -55.50,    9.04,   43.36] )
2:  Atom( CD1:8   [ -55.42,   14.31,   43.09] )
3:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
4:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
5:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
6:  Atom( O:4     [ -57.04,    9.73,   41.82] )
7:  Atom( C:3     [ -56.06,    9.95,   42.55] )
8:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
9:  Atom( N:1     [ -54.07,   11.27,   41.93] )
)

gives the first 10 atoms in reverse order.

The returned Selector_Atom_ is itself a molecular container, which has all of the same indexing and other functions as Atom et al.

>>> print(mol[0:10][-1:4:-1])
Selector<SireMol::Atom>( size=5
0:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
1:  Atom( N:9     [ -55.50,    9.04,   43.36] )
2:  Atom( CD1:8   [ -55.42,   14.31,   43.09] )
3:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
4:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
)

gets the 6th to 10th atoms in reverse order.

You can also pass in a list of indicies of atoms.

>>> print(mol[[0, 2, 4, 6, 8]])
Selector<SireMol::Atom>( size=5
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( C:3     [ -56.06,    9.95,   42.55] )
2:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
3:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
4:  Atom( N:9     [ -55.50,    9.04,   43.36] )
)

This gives the atoms at indicies 0, 2, 4, 6, and 8.

This indexing is a shorthand for calling the atom() (for single) and atoms() (for multiple) atom functions, which are available on all of the molecular container types.

You can call these directly, e.g.

>>> print(mol.atom(0))
Atom( N:1     [ -54.07,   11.27,   41.93] )
>>> print(mol.atoms([0, 2, 4, 6, 8]))
Selector<SireMol::Atom>( size=5
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( C:3     [ -56.06,    9.95,   42.55] )
2:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
3:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
4:  Atom( N:9     [ -55.50,    9.04,   43.36] )
)

The atom() function is guaranteed to always return either a single atom, or raise an exception if this is not possible.

The atoms() function will return either a single atom or a selector of atoms, or raise an exception if this is not possible.

Accessing by name#

So far we have been accessing the molecular containers by index, as if they were Python lists. We can also treat the molecular containers like Python dictionaries, and get atoms by their name.

For example, to get the atoms called C we would use

>>> print(mol["C"])
Selector<SireMol::Atom>( size=1494
0:  Atom( C:3     [ -56.06,    9.95,   42.55] )
1:  Atom( C:11    [ -56.14,    7.05,   42.06] )
2:  Atom( C:18    [ -54.61,    4.90,   40.03] )
3:  Atom( C:26    [ -54.80,    2.14,   38.44] )
4:  Atom( C:35    [ -53.58,   -0.34,   36.88] )
...
1489:  Atom( C:11625 [  23.84,   -6.28,   31.63] )
1490:  Atom( C:11630 [  26.57,   -5.11,   30.71] )
1491:  Atom( C:11637 [  28.49,   -2.95,   31.83] )
1492:  Atom( C:11646 [  30.02,   -0.22,   31.11] )
1493:  Atom( C:11654 [  32.09,   -0.82,   34.12] )
)

Note that there are multiple atoms in this molecule called C, hence several are returned. This would raise an exception if you called the shorthand atom() function with this name,

>>> print(mol.atom("C"))
---------------------------------------------------------------------------
KeyError                                  Traceback (most recent call last)
Input In [10], in <cell line: 1>()
----> 1 mol.atom("C")

KeyError: "SireMol::duplicate_atom: More than one atom matches the ID
AtomName('C') (number of matches is 1494).
(call Sire.Error.get_last_error_details() for more info)"

A KeyError exception has been raised because there are multiple atoms in this protein that are called C and Sire does not know which one you want.

In this case, you would have to use the shorthand atoms() function.

>>> print(mol.atoms("C"))
Selector<SireMol::Atom>( size=1494
0:  Atom( C:3     [ -56.06,    9.95,   42.55] )
1:  Atom( C:11    [ -56.14,    7.05,   42.06] )
2:  Atom( C:18    [ -54.61,    4.90,   40.03] )
3:  Atom( C:26    [ -54.80,    2.14,   38.44] )
4:  Atom( C:35    [ -53.58,   -0.34,   36.88] )
...
1489:  Atom( C:11625 [  23.84,   -6.28,   31.63] )
1490:  Atom( C:11630 [  26.57,   -5.11,   30.71] )
1491:  Atom( C:11637 [  28.49,   -2.95,   31.83] )
1492:  Atom( C:11646 [  30.02,   -0.22,   31.11] )
1493:  Atom( C:11654 [  32.09,   -0.82,   34.12] )
)

Note

Using the index operator (mol["C"]) is easiest, as it will always do the right thing. Use the atom() and atoms() functions only when you want to ensure that the container will return atoms.

As before, the returned Selector_Atom_ is itself a container, and so also has its own .atom(), .atoms() and indexing functions, e.g.

>>> print(mol["C"][0])
Atom( C:3     [ -56.06,    9.95,   42.55] )

gives the atom at the index 0 in the container of atoms that are called C, and

>>> print(mol["C"][-1])
Atom( C:11654 [  32.09,   -0.82,   34.12] )

gives the last atom in the container of atoms that are called C.

Asking for an atom that doesn’t exist will result in a KeyError exception being raised.

>>> print(mol["X"])
---------------------------------------------------------------------------
KeyError                                  Traceback (most recent call last)
Input In [24], in <cell line: 1>()
----> 1 print(mol["X"])

File ~/sire.app/lib/python3.8/site-packages/Sire/Mol/__init__.py:419, in __fixed__getitem__(obj, key)
    417     return obj.residues(key)
    418 else:
--> 419     return obj.atoms(key)

File ~/sire.app/lib/python3.8/site-packages/Sire/Mol/__init__.py:428, in __fixed__atoms__(obj, idx)
    426     return obj.__orig__atoms(list(idx))
    427 else:
--> 428     return obj.__orig__atoms(idx)

KeyError: 'SireMol::missing_atom: There is no atom called "X" in the layout "{c4d51f89-f4f7-4e0c-854d-da27affe1baf}". (call Sire.Error.get_last_error_details() for more info)'

Searching by number#

So far, we have identified atoms by their index or by their name. There is a third way to identify atoms. This is by their atom number. The atom number is an identifying number that is assigned to atoms, typically via an atom number column in the input file. We can look up atoms by number using a search string.

>>> print(mol["atomnum 1"])
Atom( N:1     [ -54.07,   11.27,   41.93] )

Here, atomnum 1 is a search string that looks for atoms with number 1.

Note

Note how the name N and number 1 of the atom are printed out, along with its coordinates.

This search string is very powerful. For example, you can search for atoms that have a number that is less than 10.

>>> print(mol["atomnum < 10"])
Selector<SireMol::Atom>( size=9
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
2:  Atom( C:3     [ -56.06,    9.95,   42.55] )
3:  Atom( O:4     [ -57.04,    9.73,   41.82] )
4:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
5:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
6:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
7:  Atom( CD1:8   [ -55.42,   14.31,   43.09] )
8:  Atom( N:9     [ -55.50,    9.04,   43.36] )
)

You can combine search strings with logical operators (e.g. and, or and not).

>>> print(mol["atomnum >= 5 and atomnum < 10"])
Selector<SireMol::Atom>( size=5
0:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
1:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
2:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
3:  Atom( CD1:8   [ -55.42,   14.31,   43.09] )
4:  Atom( N:9     [ -55.50,    9.04,   43.36] )
)

and can also search for multiple atom numbers

>>> print(mol["atomnum 1, 3, 5, 7"])
Selector<SireMol::Atom>( size=4
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( C:3     [ -56.06,    9.95,   42.55] )
2:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
3:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
)

or ranges of atom numbers

>>> print(mol["atomnum 1:5"])
Selector<SireMol::Atom>( size=4
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
2:  Atom( C:3     [ -56.06,    9.95,   42.55] )
3:  Atom( O:4     [ -57.04,    9.73,   41.82] )
)

or ranges with steps, e.g.

>>> print(mol["atomnum 1:91:10"])
Selector<SireMol::Atom>( size=9
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( C:11    [ -56.14,    7.05,   42.06] )
2:  Atom( CG:21   [ -54.57,    8.40,   38.40] )
3:  Atom( CE:31   [ -58.18,    0.27,   42.14] )
4:  Atom( C:41    [ -52.69,   -3.63,   36.40] )
5:  Atom( OD1:51  [ -47.78,   -5.70,   36.23] )
6:  Atom( CB:61   [ -52.91,   17.54,   38.36] )
7:  Atom( C:71    [ -56.03,   16.00,   33.41] )
8:  Atom( CB:81   [ -52.48,   16.28,   32.24] )
)

Note

Search number ranges are half-open, like Python ranges. Also note that the results appear in the order the atoms match from their molecular container, not the order of the numbers (range) in the search string. So mol["atomnum 10:1:-1"] would not reverse the atoms. The search string asks for any atoms that have a number between 10 and 1.

You can even mix combinations of multiple atom numbers and ranges!

>>> print(mol["atomnum 1:4, 7:11, 20, 30"])
Selector<SireMol::Atom>( size=9
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
2:  Atom( C:3     [ -56.06,    9.95,   42.55] )
3:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
4:  Atom( CD1:8   [ -55.42,   14.31,   43.09] )
5:  Atom( N:9     [ -55.50,    9.04,   43.36] )
6:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
7:  Atom( CB:20   [ -53.99,    7.18,   39.13] )
8:  Atom( CD:30   [ -57.70,   -0.34,   40.83] )
)

Searching by name#

You can include atom names (atomname) in the search string.

>>> print(mol["atomname CA"])
Selector<SireMol::Atom>( size=1494
0:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
1:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
2:  Atom( CA:17   [ -54.99,    6.39,   39.98] )
3:  Atom( CA:25   [ -55.33,    2.58,   39.80] )
4:  Atom( CA:34   [ -52.97,    1.03,   37.19] )
...
1489:  Atom( CA:11624 [  22.43,   -6.30,   32.21] )
1490:  Atom( CA:11629 [  25.36,   -5.51,   29.89] )
1491:  Atom( CA:11636 [  27.51,   -3.84,   32.59] )
1492:  Atom( CA:11645 [  28.74,   -0.85,   30.58] )
1493:  Atom( CA:11653 [  31.65,   -0.00,   32.91] )
)

can search for multiple atom names

>>> print(mol["atomname CA, C, N"])
Selector<SireMol::Atom>( size=4482
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
2:  Atom( C:3     [ -56.06,    9.95,   42.55] )
3:  Atom( N:9     [ -55.50,    9.04,   43.36] )
4:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
...
4477:  Atom( CA:11645 [  28.74,   -0.85,   30.58] )
4478:  Atom( C:11646 [  30.02,   -0.22,   31.11] )
4479:  Atom( N:11652 [  30.40,   -0.51,   32.35] )
4480:  Atom( CA:11653 [  31.65,   -0.00,   32.91] )
4481:  Atom( C:11654 [  32.09,   -0.82,   34.12] )
)

Note

The atoms will appear in the order they match from their molecular container, not the order the atom names appear in the search string.

and can use mixtures of any identifiers, e.g.

>>> print(mol["atomnum > 1000 and atomname N"])
Selector<SireMol::Atom>( size=1369
0:  Atom( N:1005  [ -30.70,  -11.17,   28.73] )
1:  Atom( N:1014  [ -32.11,  -10.01,   25.69] )
2:  Atom( N:1023  [ -32.43,  -11.41,   22.41] )
3:  Atom( N:1027  [ -32.33,  -11.69,   18.84] )
4:  Atom( N:1038  [ -35.13,  -10.12,   17.49] )
...
1364:  Atom( N:11623 [  22.09,   -7.64,   32.65] )
1365:  Atom( N:11628 [  24.07,   -5.44,   30.61] )
1366:  Atom( N:11635 [  26.39,   -4.38,   31.81] )
1367:  Atom( N:11644 [  28.07,   -1.72,   31.54] )
1368:  Atom( N:11652 [  30.40,   -0.51,   32.35] )
)

Searching by wildcards (glob matching)#

You can use wildcards (glob matching) when searching for atom names.

>>> print(mol["atomname /c*/i"])
Selector<SireMol::Atom>( size=7440
0:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
1:  Atom( C:3     [ -56.06,    9.95,   42.55] )
2:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
3:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
4:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
...
7435:  Atom( C2:11705 [  10.05,   21.52,   30.16] )
7436:  Atom( C3:11706 [   9.26,   22.77,   30.56] )
7437:  Atom( C4:11708 [   8.69,   22.56,   31.97] )
7438:  Atom( C5:11709 [   7.74,   23.65,   32.46] )
7439:  Atom( C6:11712 [   8.10,   23.00,   29.56] )
)

The wildcard search must be placed between backslashes (/). The i at the end instructs Sire to do a case-insensitive search.

Wildcard searching is very powerful. For example, here we search for all atoms that are called N something number.

>>> print(mol["atomname /N?[0-9]/"])
Selector<SireMol::Atom>( size=378
0:  Atom( NE2:100 [ -58.70,   18.42,   26.32] )
1:  Atom( ND2:157 [ -56.54,    0.31,   26.32] )
2:  Atom( NH1:277 [ -41.15,   21.58,   37.21] )
3:  Atom( NH2:278 [ -39.20,   22.44,   36.34] )
4:  Atom( ND1:285 [ -44.10,   18.72,   31.40] )
...
373:  Atom( NH2:11530 [  11.73,   18.45,   35.06] )
374:  Atom( NE2:11555 [  18.10,    6.67,   34.31] )
375:  Atom( NH1:11573 [  11.37,    5.05,   33.21] )
376:  Atom( NH2:11574 [  12.54,    6.84,   34.03] )
377:  Atom( ND2:11651 [  25.41,    0.07,   29.66] )
)

More information about the glob matching syntax can be found on its wikipedia page.

This is implemented by QRegularExpression’s wildCardToRegularExpression function.

Searching by element#

The search strings are very powerful, and are described in more detail in a later chapter. One cool feature is that you can search for atoms by their element.

>>> print(mol["element C"])
Selector<SireMol::Atom>( size=7440
0:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
1:  Atom( C:3     [ -56.06,    9.95,   42.55] )
2:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
3:  Atom( CG1:6   [ -55.68,   13.72,   41.72] )
4:  Atom( CG2:7   [ -57.70,   12.40,   42.39] )
...
7435:  Atom( C2:11705 [  10.05,   21.52,   30.16] )
7436:  Atom( C3:11706 [   9.26,   22.77,   30.56] )
7437:  Atom( C4:11708 [   8.69,   22.56,   31.97] )
7438:  Atom( C5:11709 [   7.74,   23.65,   32.46] )
7439:  Atom( C6:11712 [   8.10,   23.00,   29.56] )
)

You can use either the element’s symbol, or its full name

>>> print(mol["element nitrogen"])
Selector<SireMol::Atom>( size=2018
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( N:9     [ -55.50,    9.04,   43.36] )
2:  Atom( N:16    [ -55.01,    6.94,   41.35] )
3:  Atom( N:24    [ -55.57,    4.01,   39.78] )
4:  Atom( NZ:32   [ -59.38,   -0.44,   42.67] )
...
2013:  Atom( N:11628 [  24.07,   -5.44,   30.61] )
2014:  Atom( N:11635 [  26.39,   -4.38,   31.81] )
2015:  Atom( N:11644 [  28.07,   -1.72,   31.54] )
2016:  Atom( ND2:11651 [  25.41,    0.07,   29.66] )
2017:  Atom( N:11652 [  30.40,   -0.51,   32.35] )
)

or search for multiple elements at a time

>>> print(mol["element C, O, N"])
Selector<SireMol::Atom>( size=11660
0:  Atom( N:1     [ -54.07,   11.27,   41.93] )
1:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
2:  Atom( C:3     [ -56.06,    9.95,   42.55] )
3:  Atom( O:4     [ -57.04,    9.73,   41.82] )
4:  Atom( CB:5    [ -56.32,   12.33,   41.76] )
...
11655:  Atom( O:11728 [  -4.73,   43.02,   37.18] )
11656:  Atom( O:11729 [  28.91,  -18.15,   61.95] )
11657:  Atom( O:11730 [   5.57,    7.38,   35.58] )
11658:  Atom( O:11731 [  24.49,  -20.60,   42.83] )
11659:  Atom( O:11732 [   4.73,    5.03,   46.82] )
)

Uniquely identifying atoms#

It is often useful to have a unique identifier for an atom in a molecule. The name and number can’t do this, as it is possible that many atoms could have the same name, and/or the same number.

To solve this, Sire uses the index of the atom in the molecule as its unique identifier. This index is called atomidx, and can also be used in searches.

>>> print(mol["atomidx 0"])
Atom( N:1     [ -54.07,   11.27,   41.93] )

This has printed the first atom in the molecule.

You can get an atom’s index using the index() function.

>>> print(mol["atomidx 0"].index())
AtomIdx(0)

Warning

Be careful searching with atomidx. This is the unique index of the atom within its parent molecule, not the index of the atom in a container. So mol[5:10]["atomidx 0"] would raise a KeyError as the first atom in the molecule is not contained in the slice of atoms 5 to 9.

Atom identifying types#

Another way to index atoms is to use the atom indexing types, i.e. AtomIdx, AtomName and AtomNum. The easiest way to create these is by using the function sire.atomid().

>>> print(mol[sr.atomid("CA")])
Selector<SireMol::Atom>( size=1494
0:  Atom( CA:2    [ -55.43,   11.35,   42.54] )
1:  Atom( CA:10   [ -56.02,    7.64,   43.47] )
2:  Atom( CA:17   [ -54.99,    6.39,   39.98] )
3:  Atom( CA:25   [ -55.33,    2.58,   39.80] )
4:  Atom( CA:34   [ -52.97,    1.03,   37.19] )
...
1489:  Atom( CA:11624 [  22.43,   -6.30,   32.21] )
1490:  Atom( CA:11629 [  25.36,   -5.51,   29.89] )
1491:  Atom( CA:11636 [  27.51,   -3.84,   32.59] )
1492:  Atom( CA:11645 [  28.74,   -0.85,   30.58] )
1493:  Atom( CA:11653 [  31.65,   -0.00,   32.91] )
)

This returns the atoms called “CA”, as sr.atomid("CA") has created an AtomName object.

>>> print(sr.atomid("CA"))
AtomName('CA')

This function will create an AtomNum if it is passed an integer, e.g.

>>> print(sr.atomid(1))
AtomNum(1)
>>> print(mol[sr.atomid(1)])
Atom( N:1     [ -54.07,   11.27,   41.93] )

You can set both a name and a number by passing in two arguments, e.g.

>>> print(mol[sr.atomid("CA", 10)])
Atom( CA:10   [ -56.02,    7.64,   43.47] )
>>> print(mol[sr.atomid(name="CA", num=10)])
Atom( CA:10   [ -56.02,    7.64,   43.47] )

Iterating over atoms#

The Selector_Atom_ class is iterable, meaning that it can be used in loops.

>>> for atom in mol["atomnum < 10"]:
...     print(atom)
Atom( N:1     [ -54.07,   11.27,   41.93] )
Atom( CA:2    [ -55.43,   11.35,   42.54] )
Atom( C:3     [ -56.06,    9.95,   42.55] )
Atom( O:4     [ -57.04,    9.73,   41.82] )
Atom( CB:5    [ -56.32,   12.33,   41.76] )
Atom( CG1:6   [ -55.68,   13.72,   41.72] )
Atom( CG2:7   [ -57.70,   12.40,   42.39] )
Atom( CD1:8   [ -55.42,   14.31,   43.09] )
Atom( N:9     [ -55.50,    9.04,   43.36] )

Counting atoms#

You can find all of the names of the atoms using

>>> print(mol.atoms().names())
[AtomName('N'), AtomName('CA'), AtomName('C'), AtomName('O'),
 AtomName('CB'), AtomName('CG1'), AtomName('CG2'), AtomName('CD1'),
 AtomName('N'), AtomName('CA')....]

The set of names used can be found via a Python set, e.g.

>>> print(set(mol.atoms().names()))
{AtomName('CE1'), AtomName('CE2'), AtomName('CE3'), AtomName('OE1'),
 AtomName('OE2'), AtomName('CZ2'), AtomName('CZ3'), AtomName('NE'),
 AtomName('NH1'), AtomName('NH2'), AtomName('ND1'), AtomName('ND2'),
 AtomName('O1'), AtomName('C'), AtomName('O2'), AtomName('O4'),
 AtomName('O3'), AtomName('O5'), AtomName('O7'), AtomName('O6'),
 AtomName('NZ'), AtomName('CG1'), AtomName('N'), AtomName('O'),
 AtomName('CG2'), AtomName('SD'), AtomName('C1'), AtomName('C2'),
 AtomName('SG'), AtomName('C3'), AtomName('C4'), AtomName('C5'),
 AtomName('OG'), AtomName('OG1'), AtomName('OH'), AtomName('NE2'),
 AtomName('NE1'), AtomName('CA'), AtomName('CB'), AtomName('CD'),
 AtomName('CE'), AtomName('C6'), AtomName('CG'), AtomName('CH2'),
 AtomName('CD1'), AtomName('CD2'), AtomName('CZ'), AtomName('OD1'),
 AtomName('OD2')}

You can use this to count the number of atoms that have each name.

>>> for name in set(mol.atoms().names()):
...     print(name, len(mol.atoms(name)))
AtomName('CE1') 128
AtomName('CE2') 110
AtomName('CE3') 24
AtomName('OE1') 139
AtomName('OE2') 107
AtomName('CZ2') 24
AtomName('CZ3') 24
AtomName('NE') 100
AtomName('NH1') 100
AtomName('NH2') 100
AtomName('ND1') 42
AtomName('ND2') 38
AtomName('O1') 6
AtomName('C') 1494
AtomName('O2') 6
AtomName('O4') 6
AtomName('O3') 2
AtomName('O5') 2
AtomName('O7') 2
AtomName('O6') 2
AtomName('NZ') 46
AtomName('CG1') 138
AtomName('N') 1494
AtomName('O') 1512
AtomName('CG2') 226
AtomName('SD') 20
AtomName('C1') 6
AtomName('C2') 6
AtomName('SG') 48
AtomName('C3') 6
AtomName('C4') 6
AtomName('C5') 2
AtomName('OG') 102
AtomName('OG1') 88
AtomName('OH') 22
AtomName('NE2') 74
AtomName('NE1') 24
AtomName('CA') 1494
AtomName('CB') 1426
AtomName('CD') 375
AtomName('CE') 66
AtomName('C6') 2
AtomName('CG') 895
AtomName('CH2') 24
AtomName('CD1') 400
AtomName('CD2') 378
AtomName('CZ') 186
AtomName('OD1') 122
AtomName('OD2') 84

You could do something similar using the numbers() function to get the numbers of all of the atoms.