https://jerryzyc.github.io/learning/2024/11/01/learning-ppo-tricks.html 2024-11-01T00:00:00+08:00 https://jerryzyc.github.io/learning/2024/11/02/learning-domain-randomization.html 2024-11-02T00:00:00+08:00 https://jerryzyc.github.io/skills/2024/11/05/skills-robot-learning-roadmap.html 2024-11-05T00:00:00+08:00 https://jerryzyc.github.io/work/2024/11/06/work-grasping-retrospective.html 2024-11-06T00:00:00+08:00 https://jerryzyc.github.io/paper/2026/01/12/paper-sru-nav-rl.html 2026-01-12T00:00:00+08:00 https://jerryzyc.github.io/paper/2026/01/13/paper-diffusion-policy.html 2026-01-13T00:00:00+08:00 https://jerryzyc.github.io/paper/2026/01/13/paper-flying-point-clouds-rl.html 2026-01-13T00:00:00+08:00 https://jerryzyc.github.io/paper/2026/01/13/paper-wmp-locomotion.html 2026-01-13T00:00:00+08:00 https://jerryzyc.github.io/skills/math-formula-rendering/SKILL.html https://jerryzyc.github.io/skills/paper-reading-blog-generator/SKILL.html https://jerryzyc.github.io/skills/permission-operations/SKILL.html https://jerryzyc.github.io/about/ https://jerryzyc.github.io/categories/ https://jerryzyc.github.io/ https://jerryzyc.github.io/learning-notes/ https://jerryzyc.github.io/notes/templates/paper-note-template.html https://jerryzyc.github.io/paper-notes/ https://jerryzyc.github.io/search/ https://jerryzyc.github.io/learning/ https://jerryzyc.github.io/tags/ https://jerryzyc.github.io/notes/templates/work-project-template.html https://jerryzyc.github.io/work-projects/ https://jerryzyc.github.io/docs/writing-guide.html https://jerryzyc.github.io/learning/cross-cutting-concepts/generalization/ https://jerryzyc.github.io/learning/cross-cutting-concepts/information-theory/ https://jerryzyc.github.io/learning/cross-cutting-concepts/optimization-theory/ https://jerryzyc.github.io/learning/cross-cutting-concepts/probability-review/ https://jerryzyc.github.io/learning/cross-cutting-concepts/research-best-practices/ https://jerryzyc.github.io/learning/deep-learning/architectures/attention-mechanism/ https://jerryzyc.github.io/learning/deep-learning/architectures/cnn/ https://jerryzyc.github.io/learning/deep-learning/architectures/lstm-gru/ https://jerryzyc.github.io/learning/deep-learning/architectures/mlp/ https://jerryzyc.github.io/learning/deep-learning/architectures/rnn/ https://jerryzyc.github.io/learning/deep-learning/architectures/transformer/ https://jerryzyc.github.io/learning/deep-learning/dl-in-practice/debugging/ https://jerryzyc.github.io/learning/deep-learning/dl-in-practice/deployment-considerations/ https://jerryzyc.github.io/learning/deep-learning/dl-in-practice/overfitting/ https://jerryzyc.github.io/learning/deep-learning/dl-in-practice/scaling-laws/ https://jerryzyc.github.io/learning/deep-learning/foundations/activation-functions/ https://jerryzyc.github.io/learning/deep-learning/foundations/backpropagation/ https://jerryzyc.github.io/learning/deep-learning/foundations/loss-functions/ https://jerryzyc.github.io/learning/deep-learning/foundations/perceptron/ https://jerryzyc.github.io/learning/deep-learning/representation-learning/autoencoders/ https://jerryzyc.github.io/learning/deep-learning/representation-learning/contrastive-learning/ https://jerryzyc.github.io/learning/deep-learning/representation-learning/self-supervised/ https://jerryzyc.github.io/learning/deep-learning/representation-learning/vae/ https://jerryzyc.github.io/learning/deep-learning/training/generalization/ https://jerryzyc.github.io/learning/deep-learning/training/initialization/ https://jerryzyc.github.io/learning/deep-learning/training/normalization/ https://jerryzyc.github.io/learning/deep-learning/training/optimization-tricks/ https://jerryzyc.github.io/learning/deep-learning/training/regularization/ https://jerryzyc.github.io/learning/machine-learning/classical-models/gaussian-mixture/ https://jerryzyc.github.io/learning/machine-learning/classical-models/kmeans/ https://jerryzyc.github.io/learning/machine-learning/classical-models/linear-regression/ https://jerryzyc.github.io/learning/machine-learning/classical-models/logistic-regression/ https://jerryzyc.github.io/learning/machine-learning/classical-models/svm/ https://jerryzyc.github.io/learning/machine-learning/fundamentals/bias-variance/ https://jerryzyc.github.io/learning/machine-learning/fundamentals/evaluation-metrics/ https://jerryzyc.github.io/learning/machine-learning/fundamentals/problem-types/ https://jerryzyc.github.io/learning/machine-learning/fundamentals/train-test-split/ https://jerryzyc.github.io/learning/machine-learning/fundamentals/what-is-ml/ https://jerryzyc.github.io/learning/machine-learning/ml-in-practice/data-leakage/ https://jerryzyc.github.io/learning/machine-learning/ml-in-practice/failure-cases/ https://jerryzyc.github.io/learning/machine-learning/ml-in-practice/feature-engineering/ https://jerryzyc.github.io/learning/machine-learning/ml-in-practice/hyperparameter-tuning/ https://jerryzyc.github.io/learning/machine-learning/optimization/convexity/ https://jerryzyc.github.io/learning/machine-learning/optimization/gradient-descent/ https://jerryzyc.github.io/learning/machine-learning/optimization/numerical-stability/ https://jerryzyc.github.io/learning/machine-learning/optimization/regularization/ https://jerryzyc.github.io/learning/machine-learning/probabilistic-ml/bayesian-inference/ https://jerryzyc.github.io/learning/machine-learning/probabilistic-ml/em-algorithm/ https://jerryzyc.github.io/learning/machine-learning/probabilistic-ml/graphical-models/ https://jerryzyc.github.io/learning/machine-learning/probabilistic-ml/mle-map/ https://jerryzyc.github.io/learning/reinforcement-learning/advanced-topics/credit-assignment/ https://jerryzyc.github.io/learning/reinforcement-learning/advanced-topics/memory-recurrence/ https://jerryzyc.github.io/learning/reinforcement-learning/advanced-topics/multi-agent-rl/ https://jerryzyc.github.io/learning/reinforcement-learning/advanced-topics/off-policy/ https://jerryzyc.github.io/learning/reinforcement-learning/advanced-topics/stability-issues/ https://jerryzyc.github.io/learning/reinforcement-learning/deep-rl/actor-critic/ https://jerryzyc.github.io/learning/reinforcement-learning/deep-rl/dqn/ https://jerryzyc.github.io/learning/reinforcement-learning/deep-rl/policy-gradient/ https://jerryzyc.github.io/learning/reinforcement-learning/deep-rl/ppo/ https://jerryzyc.github.io/learning/reinforcement-learning/deep-rl/sac/ https://jerryzyc.github.io/learning/reinforcement-learning/dynamic-programming/policy-evaluation/ https://jerryzyc.github.io/learning/reinforcement-learning/dynamic-programming/policy-iteration/ https://jerryzyc.github.io/learning/reinforcement-learning/dynamic-programming/value-iteration/ https://jerryzyc.github.io/learning/reinforcement-learning/fundamentals/exploration-exploitation/ https://jerryzyc.github.io/learning/reinforcement-learning/fundamentals/mdp/ https://jerryzyc.github.io/learning/reinforcement-learning/fundamentals/policy-value/ https://jerryzyc.github.io/learning/reinforcement-learning/fundamentals/pomdp/ https://jerryzyc.github.io/learning/reinforcement-learning/fundamentals/return-discount/ https://jerryzyc.github.io/learning/reinforcement-learning/model-free-rl/monte-carlo/ https://jerryzyc.github.io/learning/reinforcement-learning/model-free-rl/q-learning/ https://jerryzyc.github.io/learning/reinforcement-learning/model-free-rl/sarsa/ https://jerryzyc.github.io/learning/reinforcement-learning/model-free-rl/td-learning/ https://jerryzyc.github.io/learning/reinforcement-learning/rl-in-practice/curriculum-learning/ https://jerryzyc.github.io/learning/reinforcement-learning/rl-in-practice/failure-analysis/ https://jerryzyc.github.io/learning/reinforcement-learning/rl-in-practice/reward-design/ https://jerryzyc.github.io/learning/reinforcement-learning/rl-in-practice/sim2real/ https://jerryzyc.github.io/notes/raw.html https://jerryzyc.github.io/skills/article-quality-check/SKILL.html https://jerryzyc.github.io/skills/blog-health-check/SKILL.html https://jerryzyc.github.io/skills/cross-article-consistency/SKILL.html https://jerryzyc.github.io/skills/paper-to-blog-refactor/SKILL.html https://jerryzyc.github.io/skills/personal-learning-knowledge-graph/SKILL.html https://jerryzyc.github.io/skills/safe-dependency-update/SKILL.html https://jerryzyc.github.io/assets/papers/diffusion-policy.pdf 2026-01-14T20:30:01+08:00 https://jerryzyc.github.io/assets/papers/flying-point-clouds-rl.pdf 2026-01-14T20:30:01+08:00 https://jerryzyc.github.io/assets/papers/sru-nav-rl.pdf 2026-01-14T20:30:01+08:00 https://jerryzyc.github.io/assets/papers/wmp-locomotion.pdf 2026-01-14T20:30:01+08:00