Skip to content

Navigation Menu

Appearance settings

View all features
- BY COMPANY SIZE
  Enterprises
  Small and medium teams
  Startups
  Nonprofits
- BY USE CASE
  App Modernization
  DevSecOps
  DevOps
  CI/CD
  View all use cases
- BY INDUSTRY
  Healthcare
  Financial services
  Manufacturing
  Government
  View all industries
View all solutions
- EXPLORE BY TOPIC
  AI
  Software Development
  DevOps
  Security
  View all topics
- EXPLORE BY TYPE
  Customer stories
  Events & webinars
  Ebooks & reports
  Business insights
  GitHub Skills
- SUPPORT & SERVICES
  Documentation
  Customer support
  Community forum
  Trust center
  Partners
View all resources
- COMMUNITY
  GitHub SponsorsFund open source developers
- PROGRAMS
  Security Lab
  Maintainer Community
  Accelerator
  Archive Program
- REPOSITORIES
  Topics
  Trending
  Collections
- ENTERPRISE SOLUTIONS
  Enterprise platformAI-powered developer platform
- AVAILABLE ADD-ONS
  GitHub Advanced SecurityEnterprise-grade security features
  Copilot for BusinessEnterprise-grade AI features
  Premium SupportEnterprise-grade 24/7 support
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

TheAlgorithms / Python Public

Notifications You must be signed in to change notification settings
Fork 50.1k
Star 218k

Code
Issues 162
Pull requests 762
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

feat: add confusion matrix with precision, recall, and F1 score#14318

Open

Sagargupta16 wants to merge 2 commits intoTheAlgorithms:masterTheAlgorithms/Python:masterfrom

Sagargupta16:add-confusion-matrixSagargupta16/Python:add-confusion-matrixCopy head branch name to clipboard

Conversation Commits2 (2)Checks Files changed

Open

feat: add confusion matrix with precision, recall, and F1 score#14318
Sagargupta16 wants to merge 2 commits intoTheAlgorithms:masterfrom
Sagargupta16:add-confusion-matrix

Conversation

Copy link

Sagargupta16 commented Mar 2, 2026

Describe your change:

Added classification evaluation metrics to machine_learning/:

confusion_matrix: Binary and multiclass support
precision: TP / (TP + FP)
recall: TP / (TP + FN)
f1_score: Harmonic mean of precision and recall

The existing scoring_functions.py only has regression metrics (MAE, MSE, RMSE). Classification metrics were missing.

Add an algorithm?
Fix a bug or typo in an existing algorithm?
Add or change documentation?
An existing implementation is improved

Checklist:

I have read CONTRIBUTING.md.
This pull request is all my own work -- I have not plagiarized.
I know that pull requests will not be merged if they fail the automated tests.
This PR only changes one algorithm file. To ease review, please open separate PRs for separate algorithms.
All new Python files are placed inside an existing directory.
All filenames are in all lowercase characters with no spaces or dashes.
All functions and variable names follow Python naming conventions.
All function parameters and return values are annotated with Python type hints.
All functions have doctests that pass the automated testing.
All new algorithms include at least one URL that points to Wikipedia or another similar explanation.
If this pull request is for a pre-existing algorithm, I have linked to the issue.

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Sagargupta16 and others added 2 commits

March 2, 2026 07:04


          feat: add confusion matrix with precision, recall, and F1 score

c5dd85b

Add classification evaluation metrics:
- confusion_matrix: binary and multiclass support
- precision: TP / (TP + FP)
- recall (sensitivity): TP / (TP + FN)
- f1_score: harmonic mean of precision and recall

All functions include doctests.


          [pre-commit.ci] auto fixes from pre-commit.com hooks

9ddb954

for more information, see https://pre-commit.ci

Copilot AI review requested due to automatic review settings

March 2, 2026 01:46

algorithms-keeper bot added the awaiting reviews This PR is ready to be reviewed label

Copilot started reviewing on behalf of Sagargupta16

March 2, 2026 01:46

Copilot AI reviewed

View reviewed changes

Copy link

Contributor

Copilot AI left a comment

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds core classification evaluation utilities to machine_learning/, complementing the existing regression-focused metrics by providing confusion-matrix-based scoring.

Changes:

Introduces a confusion_matrix() implementation supporting binary and multiclass labels.
Adds binary/one-vs-rest precision(), recall(), and f1_score() metrics (via positive_label).
Includes doctest examples and a __main__ doctest runner for the new module.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

machine_learning/confusion_matrix.py

Comment on lines +42 to +44

+                  matrix = np.zeros((n, n), dtype=int)
+                  for a, p in zip(actual, predicted):
+                      matrix[class_to_index[a]][class_to_index[p]] += 1

Copy link

Copilot AI Mar 2, 2026

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

zip(actual, predicted) will silently drop extra items when the input lists have different lengths, producing an incorrect confusion matrix without any error. Add an explicit length check up-front (and raise ValueError) so mismatched inputs fail fast (similar to other ML metric functions in this repo).

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Copilot uses AI. Check for mistakes.

machine_learning/confusion_matrix.py

Comment on lines +72 to +82

+                  tp = sum(
+
+                      for a, p in zip(actual, predicted)
+                      if a == positive_label and p == positive_label
+                  )
+                  fp = sum(
+
+                      for a, p in zip(actual, predicted)
+                      if a != positive_label and p == positive_label
+                  )
+                  return tp / (tp + fp) if (tp + fp) > 0 else 0.0

Copy link

Copilot AI Mar 2, 2026

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

precision() iterates with zip(actual, predicted), so if the two inputs differ in length the computation is silently truncated. Consider validating equal lengths (and raising ValueError) before computing TP/FP so callers can’t get an incorrect metric without noticing.

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Copilot uses AI. Check for mistakes.

machine_learning/confusion_matrix.py

Comment on lines +108 to +118

+                  tp = sum(
+
+                      for a, p in zip(actual, predicted)
+                      if a == positive_label and p == positive_label
+                  )
+                  fn = sum(
+
+                      for a, p in zip(actual, predicted)
+                      if a == positive_label and p != positive_label
+                  )
+                  return tp / (tp + fn) if (tp + fn) > 0 else 0.0

Copy link

Copilot AI Mar 2, 2026

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

recall() has the same silent-truncation issue as precision() due to zip(actual, predicted). Add an explicit length check (raise ValueError) before computing TP/FN.

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Copilot uses AI. Check for mistakes.

machine_learning/confusion_matrix.py

		return matrix


		def precision(actual: list, predicted: list, positive_label: int = 1) -> float:

Copy link

Copilot AI Mar 2, 2026

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The positive_label: int = 1 type hint is overly restrictive: class labels are often strings or other hashable types, and precision/recall/f1_score work as long as positive_label is comparable to items in actual/predicted. Consider loosening the annotation (e.g., a TypeVar/Hashable) to avoid misleading API contracts and type checker errors.

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Copilot uses AI. Check for mistakes.

machine_learning/confusion_matrix.py

Comment on lines +49 to +57

+              def precision(actual: list, predicted: list, positive_label: int = 1) -> float:
+                  """
+                  Calculate precision: TP / (TP + FP).
+                  Args:
+                      actual: List of actual class labels.
+                      predicted: List of predicted class labels.
+                      positive_label: The label considered as positive class.

Copy link

Copilot AI Mar 2, 2026

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

precision, recall, and f1_score are implemented as binary (or one-vs-rest via positive_label) metrics, but the docstrings don’t state this and could be interpreted as multiclass-averaged metrics. Clarify the behavior in the docstrings (and optionally add a doctest example showing one-vs-rest usage for a multiclass label set).

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Copilot uses AI. Check for mistakes.

algorithms-keeper bot added the tests are failing Do not merge until tests pass label

Copy link

Author

Sagargupta16 commented Mar 2, 2026

@copilot open a new pull request to apply changes based on the comments in this thread

All reactions

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Copilot code review Copilot Copilot left review comments

At least 1 approving review is required to merge this pull request.

Assignees

No one assigned

Labels

awaiting reviews

This PR is ready to be reviewed

tests are failing

Do not merge until tests pass

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

Uh oh!

There was an error while loading. Please reload this page.

2 participants

Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Footer

© 2026 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Community
Docs
Contact

You can’t perform that action at this time.