Blog

Your blog category

New Years Resolution Template: A Guide to Success

Welcome to the ultimate guide for achieving your New Year’s goals! As we embrace the fresh start that a new year brings, it’s time to transform your aspirations into reality with our expert-crafted New Year’s Resolution Template. Tailored to help you set, track, and accomplish your goals, this template is your roadmap to success. Whether […]

New Years Resolution Template: A Guide to Success Read More »

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Using AWS Trainium and Inferentia based instances, through SageMaker, can help users lower fine-tuning costs by up to 50%, and lower deployment costs by 4.7x, while lowering per token latency.

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium Read More »

Minecraft x Planet Earth III is the least offensive corpo collab of the year

If you own a copy of Minecraft: Bedrock Edition or Minecraft: Education Edition, you can now grab a free expansion pack based on the BBC’s Planet Earth III. Much like the previous Frozen Planet II experience, this new wildlife documentary DLC lets players explore five scenarios through the lens of animals — arctic wolves, ocelots,

Minecraft x Planet Earth III is the least offensive corpo collab of the year Read More »

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention

This post is co-written with Jayadeep Pabbisetty, Sr. Specialist Data Engineering at Merck, and Prabakaran Mathaiyan, Sr. ML Engineer at Tiger Analytics. The large machine learning (ML) model development lifecycle requires a scalable model release process similar to that of software development. Model developers often work together in developing ML models and require a robust

Build an Amazon SageMaker Model Registry approval and promotion workflow with human intervention Read More »

Build financial search applications using the Amazon Bedrock Cohere multilingual embedding model

Enterprises have access to massive amounts of data, much of which is difficult to discover because the data is unstructured. Conventional approaches to analyzing unstructured data use keyword or synonym matching. They don’t capture the full context of a document, making them less effective in dealing with unstructured data. In contrast, text embeddings use machine

Build financial search applications using the Amazon Bedrock Cohere multilingual embedding model Read More »

The ASUS AirVision M1 glasses give you big virtual screens in a travel-friendly package

At CES 2024, ASUS seems to have taken people by surprise with the announcement of its AirVision M1 glasses, with some viewing it as an alternative to Apple’s Vision Pro headset. But I discovered that ASUS’ glasses are much more of a novel alternative to portable monitors than something meant for spatial computing. The big

The ASUS AirVision M1 glasses give you big virtual screens in a travel-friendly package Read More »

Ball position tracking in the cloud with the PGA TOUR

The PGA TOUR continues to enhance the golf experience with real-time data that brings fans closer to the game. To deliver even richer experiences, they are pursuing the development of a next-generation ball position tracking system that automatically tracks the position of the ball on the green. The TOUR currently uses ShotLink powered by CDW,

Ball position tracking in the cloud with the PGA TOUR Read More »

A wild Rabbit gadget appears while Google offers its own take on Apple software tricks

The show floor at CES 2024 is open, and people have been racking up their steps, canvassing Las Vegas’ vast convention centers and hotel ballrooms to see all the latest and weirdest tech products. The Engadget team has been getting our cardio in, braving both vehicular and human traffic to get face and hand time

A wild Rabbit gadget appears while Google offers its own take on Apple software tricks Read More »

Inference Llama 2 models with real-time response streaming using Amazon SageMaker

With the rapid adoption of generative AI applications, there is a need for these applications to respond in time to reduce the perceived latency with higher throughput. Foundation models (FMs) are often pre-trained on vast corpora of data with parameters ranging in scale of millions to billions and beyond. Large language models (LLMs) are a

Inference Llama 2 models with real-time response streaming using Amazon SageMaker Read More »

Scroll to Top