• Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Trending

Camp Social: Inside the Branded Weekend Getaway for Adults

August 19, 2025

A Hiker Was Missing for Nearly a Year—Until an AI System Recognized His Helmet

August 19, 2025

The Nonsense Narrative Of Climate Change As An Existential Crisis

August 19, 2025
Facebook Twitter Instagram
  • Newsletter
  • Submit Articles
  • Privacy
  • Advertise
  • Contact
Facebook Twitter Instagram
UptownBudget
  • Home
  • Startup
  • Money & Finance
  • Starting a Business
    • Branding
    • Business Ideas
    • Business Models
    • Business Plans
    • Fundraising
  • Growing a Business
  • More
    • Innovation
    • Leadership
Subscribe for Alerts
UptownBudget
Home » The Future Of AI Is At The Edge: Cloudflare Leads The Way
Innovation

The Future Of AI Is At The Edge: Cloudflare Leads The Way

adminBy adminNovember 25, 20230 ViewsNo Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email

Cloudflare, the leading content delivery network and cloud security platform, wants to make AI accessible to developers. It has added GPU-powered infrastructure and model-serving capabilities to its edge network, bringing state-of-the-art foundation models to the masses. Any developer can tap into Cloudflare’s AI platform with a simple REST API call.

Cloudflare introduced Workers, a serverless compute platform at the edge, in 2017. Developers can use this serverless platform to create JavaScript Service Workers that run directly in Cloudflare’s edge locations around the world. With a Worker, a developer can modify a site’s HTTP requests and responses, make parallel requests, and even respond directly from the edge. Cloudflare Workers use an API that is similar to the W3C Service Workers standard.

The rise of generative AI prompted Cloudflare to augment its Workers with AI capabilities. The platform has three new elements to support AI inference:

  • Workers AI operates on NVIDIA GPUs within Cloudflare’s global network, enabling the serverless model for AI. Users only pay for what they use, allowing them to spend less time on infrastructure management and more time on their applications.
  • Vectorize, a vector database, enables easy, rapid, and cost-effective vector indexing and storage, supporting use cases that require access not only to operational models but also to customized data.
  • AI Gateway enables organizations to cache, rate limit, and monitor their AI deployments regardless of the hosting environment.

Cloudflare has partnered with NVIDIA, Microsoft, Hugging Face, Databricks, and Meta to bring the GPU infrastructure and foundation models to its edge. The platform also hosts embedding models to convert text to vectors. The Vectorize database can be used to store, index and query the vectors to add context to the LLMs in order to reduce hallucinations in responses. The AI Gateway provides observability, rate limiting and caching frequent queries, reducing the cost while improving the performance of applications.

The model catalog for Workers AI boasts the most recent and some of the best foundation models. From Meta’s Llama 2 to Stable Diffusion XL to Mistral 7B, it has everything developers need to build modern applications powered by generative AI.

Behind the scenes, Cloudflare uses ONNX Runtime, an open neural network exchange runtime, an open source project led by Microsoft, to optimize running models in resource-constrained environments. It’s the same technology that Microsoft relies on to run foundation models in Windows.

While developers can use JavaScript to write AI inference code and deploy it to Cloudflare’s edge network, it is possible to invoke the models through a simple REST API using any language. This makes it easy to infuse generative AI into web, desktop and mobile applications that run in diverse environments.

In September 2023, Workers AI was initially launched with inference capabilities in seven cities. However, Cloudflare’s ambitious goal was to support Workers AI inference in 100 cities by the end of the year, with near-ubiquitous coverage by the end of 2024.

Cloudflare is one of the first CDN and edge network providers to enhance its edge network with AI capabilities through GPU-powered Workers AI, vector database and an AI Gateway for AI deployment management. Partnering with tech giants like Meta and Microsoft, it is offering a wide model catalog and ONNX Runtime optimization.

Read the full article here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

The Nonsense Narrative Of Climate Change As An Existential Crisis

Innovation August 19, 2025

Ataccama Eyes Data Quality Cracks In AI Infrastructure Desert

Innovation August 18, 2025

A Ridiculously Simple Habit That Strengthens Love, By A Psychologist

Innovation August 17, 2025

3 Questions That Can Instantly Defuse Any Argument, By A Psychologist

Innovation August 16, 2025

4 Dismissive Phrases To Avoid In Your Relationship, By A Psychologist

Innovation August 15, 2025

Bandai Spirits Is Doing A Metal Gear REX Chogokin Toy

Innovation August 14, 2025
Add A Comment

Leave A Reply Cancel Reply

Editors Picks

Camp Social: Inside the Branded Weekend Getaway for Adults

August 19, 2025

A Hiker Was Missing for Nearly a Year—Until an AI System Recognized His Helmet

August 19, 2025

The Nonsense Narrative Of Climate Change As An Existential Crisis

August 19, 2025

Want to Maximize the Sale Price of Your Business? Start with These 5 Value Drivers

August 19, 2025

The Hoopbus Is Rolling Worldwide — And Bringing Communities Along for the Ride

August 19, 2025

Latest Posts

Donald Trump Orders Crackdown on Politically Motivated ‘Debanking’

August 18, 2025

Ataccama Eyes Data Quality Cracks In AI Infrastructure Desert

August 18, 2025

Warren Buffett’s ‘Mystery’ $1.8 Billion Investment Revealed

August 18, 2025

How Adding More Offers and Services Can Harm Your Business

August 18, 2025

Why Your 9-to-5 Might Be the Best Launchpad for Your Startup

August 17, 2025
Advertisement
Demo

UptownBudget is your one-stop website for the latest news and updates about how to start a business, follow us now to get the news that matters to you.

Facebook Twitter Instagram Pinterest YouTube
Sections
  • Growing a Business
  • Innovation
  • Leadership
  • Money & Finance
  • Starting a Business
Trending Topics
  • Branding
  • Business Ideas
  • Business Models
  • Business Plans
  • Fundraising

Subscribe to Updates

Get the latest business and startup news and updates directly to your inbox.

© 2025 UptownBudget. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.