Appendix: Linear Algebra

Appendix: Linear Algebra#

We’ll review the bare minimum of linear algebra needed for this series of notebooks. In particular, we will

introduce the concept of vectors as a data structure that encodes the quantum state of single and multi-qubit systems.
We will introduce matrices as a function that implements a quantum transformation.

Vectors#

From a programming perspective, we can think of a vector as a data structure (i.e., array) that has operations defined (\(+\), \(\cdot\)) on it satisfying certain properties (e.g., \(+\) and \(\cdot\) distribute).

Real Vectors#

vec1 = np.array([1., 2.])   # D = 2
vec1

array([1., 2.])

vec1[0], vec1[1]

(np.float64(1.0), np.float64(2.0))

draw_vecs([vec1])

../_images/98e22d32ee1a4ad1d204c76e0b33b7a7e4020ef038c616fb0a1f95f2326e4848.png

vec2 = np.array([2.3, 1.4])   # D = 2
vec2

array([2.3, 1.4])

draw_vecs([vec2])

../_images/db8a3b39c084a21a0c2375e43d478fb05ef5af404a1593367416ff46710d59a1.png

Operation 1: Vector Addition#

Vectors are a “data structure”.
The first operation we can perform on a vector is addition with another vector.

print("vec1", vec1)
print("vec2", vec2)
vec1 + vec2    # note the component-wise addition

vec1 [1. 2.]
vec2 [2.3 1.4]

array([3.3, 3.4])

draw_vecs([vec1, (vec1, vec2), vec1 + vec2])

../_images/663e15cef4970d75a881de4d591ee00d165ef4e0bd31d6f2f7ac8651b71898bb.png

Operation 2: Scaling vector#

The second operation we can perform on a vector is scaling with a number.

print(vec1)
.5*vec1

[1. 2.]

array([0.5, 1. ])

draw_vecs([vec1, .5*vec1])

../_images/3147ae74e3123709ea8110ef8c05373a66bbc0d3016f22230c434ce13f2ead13.png

Properties#

Previously, we saw scaling and addition independently.
How do scaling and addition work with each other?

# 1: Multiplying first then adding
draw_vecs([3.*vec1, (3.*vec1, 3.*vec2), 3.*vec1 + 3.*vec2])

../_images/aa28cedb2f1e14ba10f106ec7e2f13c43e4583d6e64ff9aae5a3eaa4e72a548d.png

# 2: Adding then multiplying
draw_vecs([3.*vec1, (3.*vec1, 3.*vec2), 3.*(vec1 + vec2)])

Summary#

A vector is an array of numbers notated as

\[\begin{split} x = \begin{pmatrix} x_1 \\ \vdots \\ x_n \end{pmatrix} \end{split}\]

where \(x_i\) indicates the \(i\)-th number in the vector.

Operations#

Vector addition is defined element-wise as

\[\begin{split} \begin{pmatrix} x_1 \\ \vdots \\ x_n \end{pmatrix} + \begin{pmatrix} y_1 \\ \vdots \\ y_n \end{pmatrix} = \begin{pmatrix} x_1 + y_1 \\ \vdots \\ x_n + y_n \end{pmatrix} \,. \end{split}\]

Vector scaling is defined element-wise as

\[\begin{split} c \cdot \begin{pmatrix} x_1 \\ \vdots \\ x_n \end{pmatrix} = \begin{pmatrix} cx_1 \\ \vdots \\ cx_n \end{pmatrix} \,. \end{split}\]

Properties#

Using the definition of addition and scaling, we can verify properties such as

distributivity: \(c(x + y) = cx + cy\)
associativity: \((x + y) + z= x + (y + z)\)
commutativity: \(x + y = y + x\)

Aside: “Abstract Method”#

In applied settings, we can largely work with vectors as arrays of numbers since we eventually hope to compute with them.
For theoretical purposes, we can think of vectors as any abstract set of elements that we can add and scale satisfying the rules above.

Dot Product and Orthogonality#

The dot product of two vectors \(x\) and \(y\) is defined as

\[ x \cdot y = \sum_{i} x_i y_i \,. \]

np.dot([1., 2.], [2., 3.])

np.float64(8.0)

Orthogonality#

Two vectors \(x\) and \(y\) are said to be orthogonal if \(x \cdot y = 0\).

# Orthogonal
x = np.array([1.0, 0.5])
y = np.array([-.5, 1.])
print(np.dot(x, y))
draw_vecs([x, y])

0.0

../_images/2dc8f189d6c6abf5873f45a28907a2fa1baf199ee4da7bd5523da2406ed8847b.png

# Not Orthogonal
x = np.array([1.0, .5])
y = np.array([-.5, .6])
print(np.dot(x, y))
draw_vecs([x, y])

-0.2

../_images/bcd9cf3a13d9472017a29f0c38620b0f2ccccaa6abfcd6e32ae2340749cd2230.png

Norm#

The euclidean norm measures the length of a vector.

\[ \lVert x \rVert = \sqrt{\sum_{i=1}^n x_i^2} = \sqrt{x \cdot x} \]

np.linalg.norm(np.array([1., 2., 3.])), np.sqrt(1**2 + 2**2 + 3**2)

(np.float64(3.7416573867739413), np.float64(3.7416573867739413))

Dot Product and Angle#

Let \(\theta\) be the angle between \(x\) and \(y\). Then

\[ \cos(\theta) = \frac{x \cdot y}{\lVert x \rVert \lVert y \rVert} \,. \]

Recall that two vectors are orthogonal when \(\cos(\theta) = 0\), which means that \(\theta\) is some integer multiple of \(\pi/2\) so that they are at “right angles”.

Complex Vectors#

ivec1 = np.array([1. + 1j, 2. - 3.1j])   # D = 2
ivec1

array([1.+1.j , 2.-3.1j])

ivec2 = np.array([- 1j, -2.])   # D = 2
ivec2

array([-0.-1.j, -2.+0.j])

Complex Vector Operations#

ivec1 + ivec2

array([1.+0.j , 0.-3.1j])

3j * ivec1

array([-3. +3.j,  9.3+6.j])

Complex Norm#

Let

\[\begin{split} \mathbf{z} = \begin{pmatrix} z_1 \\ \vdots \\ z_n \\ \end{pmatrix} \,. \end{split}\]

Then

\[ \lVert \mathbf{z} \rVert = \sqrt{\sum_{i=1}^n z_i \bar{z_i}} \]

np.linalg.norm(ivec2), np.sqrt(ivec2[0] * np.conjugate(ivec2[0]) + ivec2[1] * np.conjugate(ivec2[1]))

(np.float64(2.23606797749979), np.complex128(2.23606797749979+0j))

A Qubit is a 2D Complex Vector#

Recall that a qubit was defined in terms of pairs of complex numbers

\[\begin{split} |q\rangle = \{ \begin{pmatrix} a \\ b \end{pmatrix} \in \mathbb{C}^2, \sqrt{a\bar{a} + b\bar{b}} = 1\} \,. \end{split}\]

Thus a qubit can be equivalently defined as

\[ |q\rangle = \{ \mathbf{z} \in \mathbb{C}^2, \lVert \mathbf{z} \rVert = 1\} \,. \]

Matrices#

Matrices are “functions” on vector data, i.e., the transform vectors into vectors.
Whereas we might write a typical function as code, matrices can be represented as a collection of numbers.

Real Matrices#

Example: 2x2 Matrix#

\[\begin{split} \begin{pmatrix} 1 & 2 \\ 3 & 4 \end{pmatrix} \end{split}\]

np.array([[1, 2], [3, 4]])

array([[1, 2],
       [3, 4]])

Example: 2x3 Matrix#

\[\begin{split} \begin{pmatrix} 1 & 2 & 3 \\ 4 & 5 & 6 \end{pmatrix} \end{split}\]

np.array([[1, 2, 3], [4, 5, 6]])

array([[1, 2, 3],
       [4, 5, 6]])

Example: nxm Matrix#

\[\begin{split} \begin{pmatrix} a_{11} & \dots & a_{1m} \\ \vdots & \ddots & \vdots \\ a_{n1} & \dots & a_{nm} \end{pmatrix} \end{split}\]

n = 3
m = 10
A = np.zeros((n, m))
A[2, 3] = 1
A

array([[0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],
       [0., 0., 0., 0., 0., 0., 0., 0., 0., 0.],
       [0., 0., 0., 1., 0., 0., 0., 0., 0., 0.]])

Complex Matrices#

We can also have complex matrices

\[\begin{split} \begin{pmatrix} j & -2 \\ 0 & 3+2j \end{pmatrix} \end{split}\]

C = np.array([
    [1j, -2],
    [0, 3 + 2j]
])
C

array([[ 0.+1.j, -2.+0.j],
       [ 0.+0.j,  3.+2.j]])

Matrix Multiplication: View 1#

\[\begin{split} \begin{pmatrix} a_{11} & \dots & a_{1m} \\ \vdots & \ddots & \vdots \\ a_{n1} & \dots & a_{nm} \end{pmatrix} \begin{pmatrix} b_{11} & \dots & b_{1k} \\ \vdots & \ddots & \vdots \\ b_{m1} & \dots & b_{mk} \end{pmatrix} = \begin{pmatrix} \sum_{i=1}^m a_{1i}b_{i1} & \dots & \sum_{i=1}^m a_{1i}b_{ik} \\ \vdots & \ddots & \vdots \\ \sum_{i=1}^m a_{ni}b_{i1} & \dots & \sum_{i=1}^m a_{ni}b_{ik} \end{pmatrix} \end{split}\]

print(vec1)
np.array([[1, 2], [3, 4]]) @ vec1

[1. 2.]

array([ 5., 11.])

np.array([[1, 2], [3, 4]]) @ np.array([[1, 2], [3, 4]])

array([[ 7, 10],
       [15, 22]])

Matrix Multiplication: View 2#

Each column is a linear combination of the left matrix columns using the right matrix columns as the coefficients.

\[\begin{split} \begin{pmatrix} a_{11} & \dots & a_{1m} \\ \vdots & \ddots & \vdots \\ a_{n1} & \dots & a_{nm} \end{pmatrix} \begin{pmatrix} b_{1j} \\ \vdots \\ b_{mj} \end{pmatrix} = \begin{pmatrix} b_{1j} \begin{pmatrix} a_{11} \\ \vdots \\ a_{n1} \end{pmatrix} + \dots + b_{mj} \begin{pmatrix} a_{1m} \\ \vdots \\ a_{nm} \end{pmatrix} \end{pmatrix} \end{split}\]

print(np.array([[1, 2], [3, 4]]) @ vec1)
print(np.array([1, 3]) * vec1[0] + np.array([2, 4]) * vec1[1])

[ 5. 11.]
[ 5. 11.]

print(np.array([[1, 2], [3, 4]]) @ np.array([[1, 2], [3, 4]]))
np.concatenate([(np.array([1, 3]) * 1 + np.array([2, 4]) * 3).reshape(2, -1),
                (np.array([1, 3]) * 2 + np.array([2, 4]) * 4).reshape(2, -1)], axis=1)

[[ 7 10]
 [15 22]]

array([[ 7, 10],
       [15, 22]])

Matrix Multiplication is Linear#

This means that

\[ A(c(x + y)) = cAx + cAy \]

for any matrix \(A\), constant \(c\), and vectors \(x\) and \(y\).

def rotation_matrix(angle):
    theta = angle * np.pi/180
    R = np.array([
        [np.cos(theta), -np.sin(theta)],
        [np.sin(theta), np.cos(theta)]
    ]) # 2x2 matrix
    return R

R = rotation_matrix(90)
R @ (2 * (vec1 + vec2)), 2 * R @ vec1 + 2 * R @ vec2

(array([-6.8,  6.6]), array([-6.8,  6.6]))

Matrix Multiplication can be Sequenced#

We can read the matrix multiplications

\[ B A x \]

as

Apply \(A\) to \(x\)
Then apply \(B\) to the result of \(Ax\).

draw_vecs([vec1, R @ vec1 , R @ R @ vec1, R @ R @ R @ vec1, R @ R @ R @ R @ vec1])

../_images/092c1d2faf82a266d31659ada57576253063e2733ff182039cd5d4d236f487ab.png

Matrix Inverse#

\(A^{-1}\) is called the inverse of \(A\) if

\[ AA^{-1} = I = A^{-1}A \]

where \(I\) is an identity matrix, i.e., a matrix with 1s on the diagonal and 0s everywhere else.

Not every matrix has an inverse!
Interpretation of inverse of A: if A applies a linear transformation, then \(A^{-1}\) undoes the transformation done by \(A\), since \(A^{-1}A\) is the identity which is the linear transformation that does nothing.

R30 = rotation_matrix(30)
R30

array([[ 0.8660254, -0.5      ],
       [ 0.5      ,  0.8660254]])

np.linalg.inv(R30), np.linalg.inv(R30) @ R30

(array([[ 0.8660254,  0.5      ],
        [-0.5      ,  0.8660254]]),
 array([[1.00000000e+00, 7.43708407e-18],
        [6.29482353e-17, 1.00000000e+00]]))

# Inverse of rotation matrix is rotation in other direction
np.linalg.inv(R30), rotation_matrix(-30)

(array([[ 0.8660254,  0.5      ],
        [-0.5      ,  0.8660254]]),
 array([[ 0.8660254,  0.5      ],
        [-0.5      ,  0.8660254]]))

draw_vecs([vec1, R30 @ vec1, np.linalg.inv(R30) @ vec1])

../_images/90edab9456c8059c1becb7ceebdade61e58b18883015ed8f65e0b126aa8f94bc.png

Quantum gates on single qubit systems are 2x2 Unitary Matrices#

The unitary matrix

\[\begin{split} H = \frac{1}{\sqrt{2}} \begin{pmatrix} 1 & 1 \\ 1 & -1 \\ \end{pmatrix} \end{split}\]

encodes the Hadamard gate. We’ll characterize unitary matrix later.

qc_H = QuantumCircuit(1)
qc_H.h(0)
qc_H.draw()

   ┌───┐
q: ┤ H ├
   └───┘

# This produces the unitary matrix
Operator(qc_H)

Operator([[ 0.70710678+0.j,  0.70710678+0.j],
          [ 0.70710678+0.j, -0.70710678+0.j]],
         input_dims=(2,), output_dims=(2,))

Application of gate to qubit is matrix multiplication#

(zero.evolve(Operator(qc_H))).draw('latex')

\[\frac{\sqrt{2}}{2} |0\rangle+\frac{\sqrt{2}}{2} |1\rangle\]

np.array(Operator(qc_H)) @ np.array(zero)

array([0.70710678+0.j, 0.70710678+0.j])

(one.evolve(Operator(qc_H))).draw('latex')

\[\frac{\sqrt{2}}{2} |0\rangle- \frac{\sqrt{2}}{2} |1\rangle\]

np.array(Operator(qc_H)) @ np.array(one)

array([ 0.70710678+0.j, -0.70710678+0.j])

Sequencing gates corresponds to matrix multiplication#

qc_H2 = QuantumCircuit(1)
qc_H2.h(0)
qc_H2.h(0)
qc_H2.draw()

   ┌───┐┌───┐
q: ┤ H ├┤ H ├
   └───┘└───┘

Operator(qc_H2)

Operator([[1.+0.j, 0.+0.j],
          [0.+0.j, 1.+0.j]],
         input_dims=(2,), output_dims=(2,))

Operator(qc_H) @ Operator(qc_H)

Operator([[ 1.00000000e+00+0.j, -2.23711432e-17+0.j],
          [-2.23711432e-17+0.j,  1.00000000e+00+0.j]],
         input_dims=(2,), output_dims=(2,))

np.allclose(Operator(qc_H2), Operator(qc_H) @ Operator(qc_H))

True

Unitary matrix properties#

Unitary matrices have two properties that are important for quantum computing.

Every unitary matrix is invertible. This coincides with our intuition in the single qubit case that every quantum operation on the Bloch sphere should be reversible.
Every unitary matrix preserves the norm of the input vector. In symbols,

\[ \lVert U x \rVert = \lVert x \rVert \,. \]

This means that applying a quantum gate to a single qubit produces an output that is also a qubit.

# This demonstrates that H is its own inverse
Operator(qc_H) @ Operator(qc_H)

Operator([[ 1.00000000e+00+0.j, -2.23711432e-17+0.j],
          [-2.23711432e-17+0.j,  1.00000000e+00+0.j]],
         input_dims=(2,), output_dims=(2,))

# This demonstrates that H preserves the norm
np.linalg.norm(zero), np.linalg.norm(np.array(Operator(qc_H)) @ np.array(zero))

(np.float64(1.0), np.float64(0.9999999999999999))

Summary#

We had a crash course on linear algebra today and tied it back to the single qubit case
These concepts will be used throughout the course.
Next time, we will use linear algebra to begin to talk about multi-qubit systems.

Appendix: Linear Algebra

Contents

Appendix: Linear Algebra#

Vectors#

Real Vectors#

Operation 1: Vector Addition#

Operation 2: Scaling vector#

Properties#

Summary#

Operations#

Properties#

Aside: “Abstract Method”#

Dot Product and Orthogonality#

Orthogonality#

Norm#

Dot Product and Angle#

Complex Vectors#

Complex Vector Operations#

Complex Norm#

A Qubit is a 2D Complex Vector#

Matrices#

Real Matrices#

Example: 2x2 Matrix#

Example: 2x3 Matrix#

Example: nxm Matrix#

Complex Matrices#

Matrix Multiplication: View 1#

Matrix Multiplication: View 2#

Matrix Multiplication is Linear#

Matrix Multiplication can be Sequenced#

Matrix Inverse#

Quantum gates on single qubit systems are 2x2 Unitary Matrices#

Application of gate to qubit is matrix multiplication#

Sequencing gates corresponds to matrix multiplication#

Unitary matrix properties#

Summary#