Profile avatar
sadiq.toao.com
Researcher @ Cambridge CL, OCaml hacker, fmr CEO at Opsian
5 posts 203 followers 44 following
Getting Started

New paper out today on how the careful design of LLMs is crucial for expert-level evidence retrieval in conservation (but with implications for any evidence synthesis pipeline across other fields) 🌍 doi.org/10.1371/jour... and anil.recoil.org/news/2024-ce... for a summary

Just how good are locally hostable code models on Cambridge first year OCaml assignments? @anil.recoil.org , @jon.recoil.org and I wanted to find out, so ran some tests. TL;DR Qwen3 means we might need new assignments. toao.com/blog/ocaml-l...

If you are using llama.cpp, here's a workaround using grammars for getting JSON structured output from Deepseek R1 and distills: toao.com/blog/json-ou...

Part of our @ai.cam.ac.uk project on AI in Conservation was published in TREE today. We gathered conservation scientists and AI experts and looked at the key conservation areas AI could revolutionise: www.cell.com/trends/ecolo...

Working to surface challenges faced by folks at the coal face. Data in research contributions from @orbenamy.bsky.social @sadiq.toao.com @scotthosking.bsky.social Stefan Scholtes, Vasco Carvalho, Mireia Crispin and a foreward with Jess Montgomery @dianecoyle1859.bsky.social @ginasue.bsky.social

New preprint from our work on using LLMs to accelerate conservation evidence synthesis across millions of papers. We crosscheck 3 retrieval strategies against 10 LLMs and benchmark against human experts and find quite a bit of variance https://www.researchsquare.com/article/rs-5409185/v1