Building an Agentic Researcher at StackOne
How I built an AI agent that finds agentic use-cases and gets better with feedback.
How I built an AI agent that finds agentic use-cases and gets better with feedback.
Here is how I optimized the serving of my fine-tuned LLM to efficiently evaluate it on the Tau benchmark.
How to make your GPU utilisation go up by taking your GPU Memory usage How to make your GPU train an LLM fast(er)
How I spent the last day of my Dubrovnik holiday fixing my broken FPL bot while dodging lightning bolts and BBQ duties.
How I built a Fantasy Premier League MCP Server to automate my terrible FPL career.
How I built an AI agent that translates provider errors into clear resolution steps, starting with evals from day one.
The training partition is a crucial component of an evaluation. It is used to assess generalisation performance on unseen data and ensure that the model is not overfitting to the training data. Here is a (probably) new way of partitioning time-series data for training and testing.