Skip to content
View tommasocerruti's full-sized avatar
🔒
locked-in
🔒
locked-in

Highlights

  • Pro

Organizations

@evaleval

Block or report tommasocerruti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tommasocerruti/README.md

Pinned Loading

  1. detllm detllm Public

    Deterministic-mode checks for LLM inference: measure run/batch variance, generate repro packs, and explain why outputs differ.

    Python 18 1

  2. rowlang rowlang Public

    RowLang is a minimalistic esoteric programming language written as an analogy to rowing.

    C 20 1

  3. algolab-2024 algolab-2024 Public

    Algorithms Lab (Algolab) HS 2024 @ ETH Zurich, solutions and insights

    C++ 16 3

  4. tiny-word2vec tiny-word2vec Public

    Numpy implementation of Word2Vec using SGNS

    Python

  5. evaleval/every_eval_ever evaleval/every_eval_ever Public

    Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation results — from leaderboard scrapes and research papers to loca…

    Python 52 29

  6. cocoabench/cocoa-agent cocoabench/cocoa-agent Public

    An agent framework for building and evaluating general digital agents.

    Python 28 16