iseff.com · TEXXR

/

Navigation

C

Chronicles

Browse all articles

C

E

Explore

Semantic exploration

E

R

Research

Entity momentum

R

N

Nexus

Correlations & relationships

N

~

Story Arc

Topic evolution

S

↻

Drift Map

Semantic trajectory animation

D

P

Posts

Analysis & commentary

P

Browse

@

Entities

Companies, people, products, technologies

◇

Domains

Browse by publication source

☉

Handles

Browse by social media handle

Detection

?

Concept Search

Semantic similarity search

!

High Impact Stories

Top coverage by position

+

Sentiment Analysis

Positive/negative coverage

*

Anomaly Detection

Unusual coverage patterns

Analysis

vs

Rivalry Report

Compare two entities head-to-head

/\

Semantic Pivots

Narrative discontinuities

!!

Crisis Response

Event recovery patterns

Connected

Nav: C E R N

Search: /

Command: ⌘K

Embeddings: large

2024-12-26

Interesting to think about the difficulty in creating *good* evals. [embedded post]

2024-12-26 View on X

Time

A look at the more challenging AI evaluations emerging in response to the rapid progress of models, including FrontierMath, Humanity's Last Exam, and RE-Bench

more interesting than it sounds! LinkedIn: Ross Dawson : The frontier of “evals”. Evaluations comparing AI ahd human capabilities are evolving rapidly as AI rapidly leaves existin...

View original