A postmortem of HyperWrite's Reflection 70B model blames "a bug in the initial code for benchmarking", after evaluators couldn't reproduce some claimed results (Carl Franzen/VentureBeat)
Carl Franzen / VentureBeat: A postmortem of HyperWrite’s Reflection 70B model blames “a bug in the initial code for benchmarking”,…