Articles

  • 6 days ago | thenewstack.io | Janakiram MSV

    Tutorial: GPU-Accelerated Serverless Inference With Google Cloud Run Feature image via Unsplash. Recently, Google Cloud launched GPU support for the Cloud Run serverless platform. This feature enables developers to accelerate serverless inference of models deployed on Cloud Run.

  • 1 week ago | forbes.com | Janakiram MSV

    OpenAI launched its GPT-4.1 family of AI models focusing on enhancing developer productivity through improved coding, long-context handling and instruction-following capabilities available directly via its application programming interface. The release includes three distinct models, GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, signaling a move toward task-specific optimizations within the large language model landscape.

  • 1 week ago | forbes.com | Janakiram MSV

    At Cloud Next, Google Cloud’s annual user conference, the company presented its vision for AI agents. Google has launched tools and services enabling developers and business users to build agents. Gemini as the Foundation of Google’s AI StrategyGemini is the cornerstone of Google’s AI agent strategy, leveraging its advanced multimodal capabilities to process and generate responses across text, images, audio, video and code.

  • 2 weeks ago | thenewstack.io | Janakiram MSV

    Serving large language models (LLMs) at scale presents many challenges beyond those faced by traditional web services or smaller ML models. Cost is a primary concern for LLM inference, which requires powerful GPUs or specialized hardware, enormous memory and significant energy. Without careful optimization, operational expenses can skyrocket for high-volume LLM services.

  • 3 weeks ago | thenewstack.io | Janakiram MSV

    Microservices changed how we build software by breaking systems into composable, independently deployable units. But as systems scale, so does the cognitive and operational load on developers — tracking dependencies, debugging across services, and managing deployments. We’re hitting diminishing returns. Enter agentic workflows: systems where autonomous agents interpret goals, plan actions, and execute tasks using the available tools.

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →

X (formerly Twitter)

Followers
10K
Tweets
11K
DMs Open
Yes
Janakiram MSV
Janakiram MSV @janakiramm
8 Apr 25

RT @rseroter: Most folks should be asking themselves "how does what I do feed into AI apps or agents?" It'll be relevant in some way. For…

Janakiram MSV
Janakiram MSV @janakiramm
3 Apr 25

RT @thenewstack: Tutorial: Set Up a Cloud Native GPU Testbed With Nvkind Kubernetes | By @janakiramm https://t.co/FSBCXse1cz

Janakiram MSV
Janakiram MSV @janakiramm
8 Feb 25

Back to basics. Renewed my @googlecloud Professional Cloud Architect certification for the 4th time. https://t.co/qpVBoUJAPT