Smart future
A site by Francesco Mugnai

Insights, news and reflections on a rapidly evolving world.

10 September 2024 Approfondimento

How to self-host LLaMA 3.1 70B without spending a fortune

Autonomously hosting an LLM such as the 70-billion-parameter LLaMA 3.1 may seem challenging, but with the right hardware optimizations such as appropriate GPUs, quantization techniques and sharding, it can be done without spending a fortune. Hybrid cloud solutions offer a good trade-off between cost and flexibility while maintaining control over the data

7 September 2024

Replit agent, code within everyone's reach and two prompts for Reflection and Claude

Replit has launched Replit Agent, an AI assistant to simplify software development. This tool converts natural language instructions into working code, allowing anyone to create applications, even without technical experience. Next we see a couple of interesting prompts for Claude and Reflection 70B

5 September 2024 Approfondimento

How to try FLUX on Mac thanks to a simple library

MFLUX brings FLUX to the Apple ecosystem through a careful port to Apple MLX. Key features include clean code, a minimalist approach without unnecessary configurations, and reduced dependencies. It supports FLUX.1-Schnell and FLUX.1-Dev models and is a breeze to install using pip. Image generation can be customized via command-line options, and it supports quantization to boost performance on Mac devices.

30 August 2024

Bland.ai, the telephone agent of the future, Magic and the 100 million token LLM of context

Bland, the AI phone agent that "talks" like a human and can handle calls in multiple languages, receives a 22 million grant. Meanwhile, Magic and Google Cloud launch an impressive AI model that handles contexts of up to 100 million tokens, promising to revolutionize code synthesis and beyond.