Grant #240273 to Web AI Fund

Web LLM Accelerator Framework

Rejected

Grant #240273

llm

Submitted by Derrick Cerf

Feb 26, 2025

Request Details

Project Summary

WebLLM Accelerator is an open-source toolkit that enables efficient deployment and operation of smaller language models directly in the browser using WebGPU, eliminating cloud dependencies while preserving user privacy.

Why This Matters

Current browser-based AI implementations face significant performance bottlenecks and memory constraints. My framework will address these challenges by:
- Optimizing model quantization specifically for WebGPU constraints
- Implementing progressive loading for larger models
- Providing cross-framework bindings for React, Vue, and Svelte

Technical Implementation Plan ( what I have done so far )

1. Develop a WebGPU-optimized inference engine for 1-3B parameter models
2. Create adaptive quantization techniques that respond to device capabilities
3. Build a prompt engineering toolkit that maximizes performance from smaller models
4. Provide simple APIs that abstract WebGPU complexity from developers

Initial Milestones (3-month timeline)

- Month 1: WebGPU kernel optimization and model compression toolkit
- Month 2: Progressive loading system and framework integrations
- Month 3: Documentation, demos, and educational resources

Why This Will Succeed

The project directly addresses two key fund priorities: enabling LLMs in-browser via WebGPU and supporting framework ecosystem integration. By focusing on making smaller models more powerful rather than just running large models inefficiently, I create practical solutions developers can use today.

Implementation Guidance
To work on this project:

1. **Build my expertise**:
- Learn WebGPU fundamentals (see [WebGPU samples repository] https://github.com/webgpu/webgpu-samples
- Understand model quantization techniques (INT8, INT4)
- Familiarize myself with smaller LLMs (Phi, TinyLlama, etc.)

2. **Start small**:
- Begin by implementing a simple matrix multiplication operation in WebGPU
- Build a proof-of-concept with a tiny model (~100M parameters)
- Gradually scale up complexity

3. **Leverage existing tools**:
- Fork and modify ONNX.js or TensorFlow.js as starting points
- Study WebAssembly-based ML projects for optimization techniques
- Connect with the WebGPU community for technical guidance

4. **Focus on demonstrable results**:
- Create compelling demos showing real-world applications
- Benchmark against server-based alternatives
- Document performance improvements clearly

This project provides practical value while pushing technical boundaries in browser-based AI.

$5,000.00 USD

Total amount $5,000.00 USD

Additional Information

Project

Web AI Fund@web-ai-fund

Balance:

$180,000.00 USD

Pay to

Derrick Cerf@derrick-cerf

payout method

Bank account

Details

********

By Derrick Cerf

on February 26, 2025

Expense created

By Addy Osmani

on March 3, 2025

Expense rejected

Project balance

$180,000.00 USD

Current Fiscal Host
Open Source Collective

Expense policies

We process expenses twice a week after an admin of the Collective has approved them. We make payments via PayPal and Bank Transfer (using Wise) and can only make payouts to countries served by these payment processors. You are not required to upload an invoice document (the data you submit in the expense form is sufficient), but if you would like to include an uploaded invoice, please make it out to:
Collective Name, Open Source Collective, 440 N. Barranca Avenue #3939, Covina, CA 91723, USA

INFORMATION REQUIRED ON EXPENSES:
Please refer to the documentation here for payment processing requirements on reimbursements and invoices.

REFUNDS:
If you would like a refund, please email [email protected] with the transaction #, the collective you donated to, the date, and the amount of the transaction.

FAQ

How do I get paid from a Collective?

Submit an expense and provide your payment information.

How are expenses approved?

Collective admins are notified when an expense is submitted, and they can approve or reject it.

Is my private data made public?

No. Only the expense amount and description are public. Attachments, payment info, emails and addresses are only visible to you and the admins.

When will I get paid?

Payments are processed by the Collective's Fiscal Host, the organization that hold funds on their behalf. Many Fiscal Hosts pay expenses weekly, but each one is different.

Why do you need my legal name?

The display name is public and the legal name is private, appearing on receipts, invoices, and other official documentation used for tax and accounting purposes.

More info →

Project balance

$180,000.00 USD

Fiscal Host:

Open Source Collective