Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Assete: In-Browser LLM Workflows
See a 4B browser model run locally via WebGPU in a Chrome extension, analyzing your browsing data to reconstruct cross-app workflows and turn tasks into skills.
A Chrome extension that captures browsing activity into local browser database, then runs Gemma 4 (4B) entirely in-browser via WebGPU to analyze cross-app workflows. No data leaves the browser. Live demo: the model queries my real browsing data, identifies that I was filing invoices from gmail to Google Drive folder - reconstructing intent from copy/paste patterns, typed text, and app switching. Try at assete.ai.
Assete is a privacy-focused browser extension tracking user engagement metrics.
- Gemma 4 E4BGemma 4 E4B is a high-efficiency 4-billion parameter model optimized for edge deployment and low-latency inference.Google's Gemma 4 E4B leverages a refined 4B parameter architecture to bridge the gap between mobile hardware constraints and enterprise-grade reasoning. Built on the same technical foundations as Gemini, this model utilizes advanced quantization techniques to maintain 92% of full-precision performance while operating within a 2.5GB memory footprint. It excels in local RAG (Retrieval-Augmented Generation) workflows and structured data extraction, providing developers a private, offline-capable alternative to cloud-based APIs without sacrificing execution speed.
- Qwen 3 8BQwen3 8B is an open-weight 8.2 billion parameter language model from Alibaba Cloud that natively supports seamless switching between deep reasoning and rapid dialogue.Alibaba Cloud released Qwen3 8B on April 28, 2025, introducing a highly efficient 8.2 billion parameter dense model designed to run locally on consumer hardware. This model stands out by integrating a dual-mode behavior control: a thinking mode for multi-step logical reasoning (such as complex coding and mathematics) and a non-thinking mode for fast, direct conversational responses. Built with a native context window of 32,768 tokens (extendable to 128,000 tokens via YaRN), it matches the performance of much larger models while consuming less than 5 gigabytes of memory under 4-bit quantization.
- WebGPUWebGPU is the modern JavaScript API providing high-performance 3D graphics and general-purpose GPU compute (GPGPU) access for the web platform.WebGPU is the successor to WebGL: it exposes a low-level, explicit API for leveraging the system’s Graphics Processing Unit (GPU) for both rendering and computation. The API design mirrors modern native graphics frameworks (Vulkan, Metal, Direct3D 12), ensuring better performance and compatibility across diverse hardware. It introduces first-class support for compute shaders, enabling high-speed parallel processing for tasks like machine learning and physics simulations directly in the browser. All shader code is written in the custom WebGPU Shading Language (WGSL), which is validated and compiled for security and efficiency.
- Chrome MV3Manifest V3 is the latest iteration of the Chrome extension platform, prioritizing security, privacy, and performance through service workers and declarative APIs.Chrome MV3 shifts the extension architecture from persistent background pages to ephemeral service workers, significantly reducing browser memory overhead. It replaces the broad webRequest API with declarativeNetRequest, allowing developers to define up to 30,000 static rules for content filtering without accessing raw user data. This update enforces a strict Content Security Policy (CSP) by banning remotely hosted code: all logic must be bundled within the package to ensure transparent review processes. These changes streamline extension execution while providing users with granular control over site permissions and data privacy.
- SQLSQL (Structured Query Language) is the standard language for managing and manipulating data in relational database management systems (RDBMS).SQL, or Structured Query Language, is the essential tool for handling structured data: it’s the standard for RDBMS. Use it to execute core data operations: `SELECT` to retrieve specific records, `INSERT` to add new rows, `UPDATE` to modify existing data, and `DELETE` to remove it. This language is critical across all major platforms (MySQL, PostgreSQL, Microsoft SQL Server, Oracle), ensuring data integrity and enabling complex joins across multiple tables. Every data professional needs fluency in SQL; it’s the bedrock of modern data management.