CuriousBeaver

Writing about AI research, hobbies, and life.

Recent Posts

🔬 Research November 20, 2024

Understanding Policy Gradients in Reinforcement Learning

A deep dive into policy gradient methods, exploring the math behind REINFORCE and how it enables agents to learn directly from reward signals.

#reinforcement-learning#machine-learning#deep-learning

🌍 Travel November 5, 2024

Autumn in Kyoto: Temples, Gardens, and Quiet Moments

A week wandering through Kyoto during autumn, when the maple leaves turn red and the ancient city reveals its most beautiful season.

#japan#travel#photography