Day 5: Natural Selection in Software: Implementing Fitness Functions

In the natural world, organisms survive and reproduce based on their ability to adapt to their environment. This principle of natural selection is central to the effectiveness of genetic algorithms. In software, our equivalent of survival is fitness, a quantitative measure of how well a solution performs.

Today, we focus on the role of fitness functions in guiding evolutionary progress in a genetic algorithm, and we’ll implement them in C# to evaluate and score candidate solutions effectively.

What Is a Fitness Function?

A fitness function evaluates how “good” a chromosome is relative to the goal of your problem. It provides the only feedback loop the algorithm uses to determine which solutions to preserve, recombine, or discard. In other words, fitness functions define the environment in which natural selection occurs.

In C# terms, the fitness function is typically implemented as a method that returns an int or double, with higher values indicating better solutions (though some problems invert this).

Key Characteristics of a Good Fitness Function

Objective: Clearly aligned with the problem’s goals.
Gradual: Provides continuous feedback rather than binary success/failure.
Discriminative: Can meaningfully differentiate between competing solutions.
Efficient: Should execute quickly, since it’s called repeatedly across generations.

Case Study: Matching a Target Phrase

Let’s build a fitness function for the “evolve a string” example, where the goal is to transform a random string into a target like "HELLO WORLD".

public int GetFitness(string target)
{
    int score = 0;
    for (int i = 0; i < Genes.Length; i++)
    {
        if (Genes[i] == target[i])
        {
            score++;
        }
    }
    return score;
}

This function assigns one point for every character that matches the corresponding character in the target string. A perfect match receives the highest fitness.

Using Levenshtein Distance (Advanced)

For more robust fitness scoring in string problems, you could use Levenshtein distance, which measures how many edits are needed to transform one string into another.

public int LevenshteinDistance(string a, string b)
{
    var costs = new int[b.Length + 1];
    for (int j = 0; j <= b.Length; j++)
        costs[j] = j;

    for (int i = 1; i <= a.Length; i++)
    {
        costs[0] = i;
        int prevCost = i - 1;
        for (int j = 1; j <= b.Length; j++)
        {
            int currentCost = costs[j];
            costs[j] = a[i - 1] == b[j - 1] ? prevCost : 1 + Math.Min(Math.Min(costs[j - 1], costs[j]), prevCost);
            prevCost = currentCost;
        }
    }
    return costs[b.Length];
}

Then fitness can be derived as:

public int GetFitness(string target)
{
    int distance = LevenshteinDistance(GetPhrase(), target);
    return target.Length - distance; // Higher is better
}

This gives a more nuanced view of closeness than character-by-character comparison.

Normalizing Fitness

In some domains, especially numeric or continuous spaces, it may be useful to normalize fitness values to a 0 to 1 scale. This helps with selection mechanisms like roulette-wheel or proportional selection.

public double NormalizeFitness(int fitness, int maxPossible)
{
    return (double)fitness / maxPossible;
}

Fitness in Other Domains

Route Optimization

For problems like the Traveling Salesperson Problem (TSP), fitness could be the inverse of total distance:

public double GetFitness(List<int> route, double[,] distanceMatrix)
{
    double totalDistance = 0;
    for (int i = 0; i < route.Count - 1; i++)
    {
        totalDistance += distanceMatrix[route[i], route[i + 1]];
    }

    return 1 / totalDistance; // Shorter distance = higher fitness
}

Scheduling

In a scheduling application, fitness might depend on the number of conflicts avoided, resource utilization, or total task completion time.

Fitness Drives Evolution

The fitness function is the cornerstone of evolutionary pressure. Without it, there is no basis for selection or improvement. Every other part of the genetic algorithm, selection, crossover, and mutation, depends on how well you define and compute fitness.

Design your fitness function as carefully as you would a scoring system in a game or an evaluation rubric in a grading tool. It defines what “better” means in your algorithm.

Up Next

Tomorrow we’ll explore how to select which chromosomes get to breed and which ones get removed from the population. Selection strategies determine the speed and direction of evolution, and we’ll look at techniques like tournament selection, roulette wheel, and elitism.

Until then, take a look at your own code and ask: how would I score success?