
Janakiram MSV
Senior Contributor at Forbes
Principal and Analyst at Janakiram MSV
Analyst | Advisor | Architect
Articles
-
6 days ago |
thenewstack.io | Janakiram MSV
Tutorial: GPU-Accelerated Serverless Inference With Google Cloud Run Feature image via Unsplash. Recently, Google Cloud launched GPU support for the Cloud Run serverless platform. This feature enables developers to accelerate serverless inference of models deployed on Cloud Run.
-
1 week ago |
forbes.com | Janakiram MSV
OpenAI launched its GPT-4.1 family of AI models focusing on enhancing developer productivity through improved coding, long-context handling and instruction-following capabilities available directly via its application programming interface. The release includes three distinct models, GPT-4.1, GPT-4.1 mini and GPT-4.1 nano, signaling a move toward task-specific optimizations within the large language model landscape.
-
1 week ago |
forbes.com | Janakiram MSV
At Cloud Next, Google Cloud’s annual user conference, the company presented its vision for AI agents. Google has launched tools and services enabling developers and business users to build agents. Gemini as the Foundation of Google’s AI StrategyGemini is the cornerstone of Google’s AI agent strategy, leveraging its advanced multimodal capabilities to process and generate responses across text, images, audio, video and code.
-
2 weeks ago |
thenewstack.io | Janakiram MSV
Serving large language models (LLMs) at scale presents many challenges beyond those faced by traditional web services or smaller ML models. Cost is a primary concern for LLM inference, which requires powerful GPUs or specialized hardware, enormous memory and significant energy. Without careful optimization, operational expenses can skyrocket for high-volume LLM services.
-
3 weeks ago |
thenewstack.io | Janakiram MSV
Microservices changed how we build software by breaking systems into composable, independently deployable units. But as systems scale, so does the cognitive and operational load on developers — tracking dependencies, debugging across services, and managing deployments. We’re hitting diminishing returns. Enter agentic workflows: systems where autonomous agents interpret goals, plan actions, and execute tasks using the available tools.
Try JournoFinder For Free
Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.
Start Your 7-Day Free Trial →X (formerly Twitter)
- Followers
- 10K
- Tweets
- 11K
- DMs Open
- Yes

RT @rseroter: Most folks should be asking themselves "how does what I do feed into AI apps or agents?" It'll be relevant in some way. For…

RT @thenewstack: Tutorial: Set Up a Cloud Native GPU Testbed With Nvkind Kubernetes | By @janakiramm https://t.co/FSBCXse1cz

Back to basics. Renewed my @googlecloud Professional Cloud Architect certification for the 4th time. https://t.co/qpVBoUJAPT