How can Euclidean distance be implemented in Python?

by EITCA Academy / Monday, 07 August 2023 / Published in Artificial Intelligence, EITC/AI/MLP Machine Learning with Python, Programming machine learning, Euclidean distance, Examination review

Euclidean distance is a fundamental concept in machine learning and is widely used in various algorithms such as k-nearest neighbors, clustering, and dimensionality reduction. It measures the straight-line distance between two points in a multidimensional space. In Python, implementing Euclidean distance is relatively straightforward and can be done using basic mathematical operations.

To calculate the Euclidean distance between two points, we need to follow these steps:

1. Define the two points: Let's say we have two points, A and B, in a d-dimensional space. Each point can be represented as a list or a numpy array containing the coordinates in each dimension.

2. Calculate the squared differences: For each dimension, calculate the squared difference between the coordinates of the two points. This can be done using a loop or by utilizing vectorized operations if using numpy arrays.

3. Sum the squared differences: Sum up the squared differences calculated in the previous step for all dimensions. This will give us the sum of squared differences.

4. Take the square root: Finally, take the square root of the sum of squared differences to obtain the Euclidean distance between the two points.

Here's a Python function that implements the Euclidean distance calculation:

python
import numpy as np

def euclidean_distance(pointA, pointB):
    # Convert the points to numpy arrays if they are not already
    pointA = np.array(pointA)
    pointB = np.array(pointB)
    
    # Calculate the squared differences for each dimension
    squared_diff = (pointA - pointB) ** 2
    
    # Sum up the squared differences
    sum_squared_diff = np.sum(squared_diff)
    
    # Take the square root
    distance = np.sqrt(sum_squared_diff)
    
    return distance

Let's use this function to calculate the Euclidean distance between two points:

python
point1 = [1, 2, 3]
point2 = [4, 5, 6]

distance = euclidean_distance(point1, point2)
print(distance)

Output:

5.196152422706632

In the above example, we have two points, `point1` and `point2`, represented as lists. The Euclidean distance between them is calculated using the `euclidean_distance` function, and the result is printed.

This implementation can be extended to work with points in any number of dimensions. It is also possible to optimize the implementation further by utilizing libraries such as scipy, which provide efficient implementations of distance calculations.

Calculating the Euclidean distance in Python involves calculating the squared differences between the coordinates of two points, summing up these squared differences, and taking the square root of the sum. The provided implementation is a basic example that can be extended and optimized based on specific requirements.

EITCA Academy

How can Euclidean distance be implemented in Python?

Other recent questions and answers regarding EITC/AI/MLP Machine Learning with Python:

More questions and answers:

EITCA Academy is a part of the European IT Certification framework

EITCA Academy

SIGN IN YOUR ACCOUNT TO HAVE ACCESS TO DIFFERENT FEATURES

FORGOT YOUR DETAILS?

CREATE ACCOUNT

How can Euclidean distance be implemented in Python?

Other recent questions and answers regarding EITC/AI/MLP Machine Learning with Python:

More questions and answers: