Numpy's Universal Functions
Before defining what
ufunc
are, we need to know why we need ufunc
at first place. In Python, for
loops are used to run operation on each item in the list. However, for
loops are very slow as it runs on each item in the list, one-by-one.On the other hand, we can make this iteration process faster by applying vectorized operations using NumPy universal functions
ufunc
, whose main purpose is to quickly execute repeated operations on values in NumPy arrays.Vectorization is the process of converting an algorithm from operating on a single value at a time to operating on a set of values at one time.
In the following examples, we will work with basic arithmetic operations like addition, subtraction, multiplication, division, square etc, on the NumPy array
import numpy as np
# creating an array first, using arange
arr = np.arange(1,11)
print(f"Array on which to perform ufunc: \n{arr}")
# addition
print(f"array + 1 is:\n{arr+1}")
# subtration
print(f"array - 1 is:\n{arr-1}")
# multiplication
print(f"array * 5 is:\n{arr*5}")
# division
print(f"array / 2 is:\n{arr/5}")
# negative * array
print(f"-array is:\n{-arr}")
# square
print(f"array sqaure is:\n{arr**2}")
# modulus operator
print(f"array % is:\n{arr%2}")
Array on which to perform ufunc:
[ 1 2 3 4 5 6 7 8 9 10]
array + 1 is:
[ 2 3 4 5 6 7 8 9 10 11]
array - 1 is:
[0 1 2 3 4 5 6 7 8 9]
array * 5 is:
[ 5 10 15 20 25 30 35 40 45 50]
array / 2 is:
[0.2 0.4 0.6 0.8 1. 1.2 1.4 1.6 1.8 2. ]
-array is:
[ -1 -2 -3 -4 -5 -6 -7 -8 -9 -10]
array sqaure is:
[ 1 4 9 16 25 36 49 64 81 100]
array % is:
[1 0 1 0 1 0 1 0 1 0]
We can use
np.abs
function to find the absolute value of NumPy array elements;absolute = np.array([-2,-1,0,1,2])
np.absolute(absolute)
array([2, 1, 0, 1, 2])
These trigonometric functions work on radians, so if we have an array of angles, we first need to convert it to radians by using
np.deg2rad(angles)
function. Once we have all array value in radians, only then we can call trigonometric functions.# creating array of angles
angles = np.array([0,30,45,60,90])
print(f"We have an array of following angles:\n{angles}")
rad = np.deg2rad(angles)
print(f"After converting degree into radian:\n{rad}")
# finding, sin, cos and tan of these angles, through radians
sin = np.sin(rad)
print(f"Sine of angles:\n{sin}")
cos = np.cos(rad)
print(f"Cosine of angles:\n{cos}")
tan = np.tan(rad)
print(f"Tangent of angles:\n{tan}")
We have an array of following angles:
[ 0 30 45 60 90]
After converting degree into radian:
[0. 0.52359878 0.78539816 1.04719755 1.57079633]
Sine of angles:
[0. 0.5 0.70710678 0.8660254 1. ]
Cosine of angles:
[1.00000000e+00 8.66025404e-01 7.07106781e-01 5.00000000e-01
6.12323400e-17]
Tangent of angles:
[0.00000000e+00 5.77350269e-01 1.00000000e+00 1.73205081e+00
1.63312394e+16]
Now, we will demonstrate the use of
np.arcsin
, np.arcos
, and np.arctan
functions to return the trigonometric inverse of sin
, cos
, and tan
values of angle in the arrayangle_sin = np.rad2deg(np.arcsin(sin))
print(f"We got back the angles, from arcsin and rad2degree:\n{angle_sin}")
angle_cos = np.rad2deg(np.arccos(cos))
print(f"We got back the angles, from arccos and rad2degree:\n{angle_cos}")
angle_tan = np.rad2deg(np.arctan(tan))
print(f"We got back the angles, from arctan and rad2degree:\n{angle_tan}")
We got back the angles, from arcsin and rad2degree:
[ 0. 30. 45. 60. 90.]
We got back the angles, from arccos and rad2degree:
[ 0. 30. 45. 60. 90.]
We got back the angles, from arctan and rad2degree:
[ 0. 30. 45. 60. 90.]
All of the following statistical functions can be applied, along axis of your choice using optional
axes
kwarg# creating array of random numbers
rand = np.random.RandomState(42)
ar_s = rand.randint(1000, size=(100,100))
print(f"Min value in the array: {np.min(ar_s)}")
print(f"Max value in the array: {np.max(ar_s)}")
print(f"Sum of all values in the array: {np.sum(ar_s)}")
print(f"Range of values, max-min, in the array: {np.ptp(ar_s)}")
print(f"70th percentile in the array: {np.percentile(ar_s, 75)}")
print(f"Mean value of the array: {np.mean(ar_s)}")
print(f"Median value of the array: {np.median(ar_s)}")
print(f"Standard Deviation of the array: {np.std(ar_s)}")
print(f"Variance of the array: {np.var(ar_s)}")
Min value in the array: 0
Max value in the array: 999
Sum of all values in the array: 5034706
Range of values, max-min, in the array: 999
70th percentile in the array: 757.0
Mean value of the array: 503.4706
Median value of the array: 505.5
Standard Deviation of the array: 289.70994725007284
Variance of the array: 83931.85353564
Let’s see how to calculate exponential of elements in a Numpy array:
# array
arr_el = np.array([1,2,3,4,5])
print(f"Array, we are working with: {arr_el}")
print(f"Exponent: {np.exp(arr_el)}")
print(f"E^2: {np.exp2(arr_el)}")
print(f"E^10: {np.power(10, arr_el)}")
Array, we are working with: [1 2 3 4 5]
Exponent: [ 2.71828183 7.3890561 20.08553692 54.59815003 148.4131591 ]
E^2: [ 2. 4. 8. 16. 32.]
E^10: [ 10 100 1000 10000 100000]
Now, let’s take log of each of these values -natural log, log with base 2 and 10. As we are applying these log on their corresponding exponent values, we will get our original array back in the output.
print(f"Natural Log, on values from E^: {np.log(np.exp(arr_el))}")
print(f"Log with base 2, on values from E^2: {np.log2(np.exp2(arr_el))}")
print(f"Log with base 10, on values from E^10: {np.log10(np.power(10, arr_el))}")
Natural Log, on values from E^: [1. 2. 3. 4. 5.]
Log with base 2, on values from E^2: [1. 2. 3. 4. 5.]
Log with base 10, on values from E^10: [1. 2. 3. 4. 5.]
We can specify the output location, to save the array
input_arr = np.arange(1,6)
output_arr = np.empty(5)
# using np.multiply function and saving output in empty array
np.multiply(input_arr, 10, out=output_arr)
array([10., 20., 30., 40., 50.])
In this section, we will discuss
.reduce
method which repeatedly applies a given operation (let suppose, add or multiply) to all the elements of an array and give the final result. What? Keep reading to understand reduce
method and how it differs from its conceptual equivalents.For taking sum, both
np.add.reduce
and np.sum
produces the same result (for 1D array) but former is faster than later. The same logic goes for the results of np.multiply.reduce
and its equivalent np.prod
# creating array
r = np.arange(1,11)
print(f"1D array to start with:\n{r}")
print(f"Adding all elements and reduce: \n{np.add.reduce(r)}")
print(f"Same result as np.sum: \n{np.sum(r)}")
print(f"Multiply all elements and reduce: \n{np.multiply.reduce(r)}")
print(f"Same result as np.prod: \n{np.prod(r)}")
Array to start with:
[ 1 2 3 4 5 6 7 8 9 10]
Adding all elements and reduce:
55
Same result as np.sum:
55
Multiply all elements and reduce:
3628800
Same result as np.prod:
3628800
However, for 2D array, these above mentioned equivalence isn’t there, in terms of final results.
# creating a 2D array and see how the result differs
r2d = np.arange(1,7).reshape(2,3)
print(f"2D array to start with:\n{r2d}")
print(f"Adding all elements and reduce: \n{np.add.reduce(r2d)}")
print(f"Result as np.sum: \n{np.sum(r2d)}")
print(f"Multiply all elements and reduce: \n{np.multiply.reduce(r2d)}")
print(f"Result as np.prod: \n{np.prod(r2d)}")
2D array to start with:
[[1 2 3]
[4 5 6]]
Adding all elements and reduce:
[5 7 9]
Result as np.sum:
21
Multiply all elements and reduce:
[ 4 10 18]
Result as np.prod:
720
Besides, we can use the optional
axis
keyword argument for all of functions above to perform the operation on a given axis. For .reduce
method, the default value of axis=0
np.accumulate
function perform a given operation (let suppose, add or multiply) and passes its accumulated resulted to the next element in the array. Therefore, we can see how, we reached at the final result in the array.However, for 1D arrays,
np.add.accumulate
and np.multiply.accumulate
are logical equivalents to np.cumsum
and np.cumprod
print(f"1D array to start with:\n{r}")
# add and accumulate
print(f"Add and accumulate:\n{np.add.accumulate(r)}")
print(f"Same result as np.cumsum: \n{np.cumsum(r)}")
# multiply and accumulate
print(f"Multiply and accumulate:\n{np.multiply.accumulate(r)}")
print(f"Same result as np.cumprod: \n{np.cumprod(r)}")
1D array to start with:
[ 1 2 3 4 5 6 7 8 9 10]
Add and accumulate:
[ 1 3 6 10 15 21 28 36 45 55]
Same result as np.cumsum:
[ 1 3 6 10 15 21 28 36 45 55]
Multiply and accumulate:
[ 1 2 6 24 120 720 5040 40320 362880
3628800]
Same result as np.cumprod:
[ 1 2 6 24 120 720 5040 40320 362880
3628800]
However, for 2D array, these above mentioned equivalence isn’t there
print(f"2D array to start with:\n{r2d}")
# add and accumulate
print(f"Add and accumulate:\n{np.add.accumulate(r2d)}")
print(f"Result as np.cumsum: \n{np.cumsum(r2d)}")
# multiply and accumulate
print(f"Multiply and accumulate:\n{np.multiply.accumulate(r2d)}")
print(f"Result as np.cumprod: \n{np.cumprod(r2d)}")
2D array to start with:
[[1 2 3]
[4 5 6]]
Add and accumulate:
[[1 2 3]
[5 7 9]]
Result as np.cumsum:
[ 1 3 6 10 15 21]
Multiply and accumulate:
[[ 1 2 3]
[ 4 10 18]]
Result as np.cumprod:
[ 1 2 6 24 120 720]
Besides, we can use the optional
axis
keyword argument for all of functions above to perform the operation on a given axis. For .reduce
method, the default value of axis=0
.outer
method can be applied to any ufunc
to compute the output of all pairs of two different inputs. Examples, will make the concept clear:a1 = np.arange(4)
print(f"Array to work on:\n{a1}")
print(f"add.outer on array:\n{np.add.outer(a1,a1)}")
print(f"multiply.outer on array:\n{np.multiply.outer(a1,a1)}")
Array to work on:
[0 1 2 3]
add.outer on array:
[[0 1 2 3]
[1 2 3 4]
[2 3 4 5]
[3 4 5 6]]
multiply.outer on array:
[[0 0 0 0]
[0 1 2 3]
[0 2 4 6]
[0 3 6 9]]
For example,
np.add.outer(a1,a1)
has added each element of a1
, on every element in the row, once in each row:- first row is
[0,1,2,3]
which is0
added to every element in row - second rows is
[1,2,3,4]
which is1
added to every element in the row - third rows is
[2,3,4,5]
which is2
added to every element in the row - fourth rows is
[3,4,5,6]
which is3
added to every element in the row
Last modified 6mo ago