Objective: Give the user a compact, implementation-ready one-pager user can hand to the team, then a pilot-test rubric with measurable thresholds.
One-Page Mini-Framework (Humanoids: Perception → Control → Adaptation)
Thesis. Reliable perception transforms sensor data into an actionable state; adaptability updates models/policies so whole-body behavior remains competent under change. The stack below is the shortest path from sensors to stable loco-manipulation with on-robot learning.
1) System Goals (anchor)
- Primary: Stable bipedal locomotion, robust grasping, and simple loco-manipulation in semi-structured indoor spaces.
- Constraints: Real-time, on-board compute; bounded latency; safe around people; recover from perturbations and domain shift.
2) Sensors (choose minimal viable set)
- Proprioceptive: Joint encoders, motor currents, 6-axis IMUs (torso + optional foot IMUs), force/torque at wrists/ankles.
- Exteroceptive: Stereo or RGB-D head camera (tilt-pan), optional LiDAR; hand/wrist camera for grasp; sparse tactile pads where feasible.
- Clocking: PPS or PTP-like sync; hardware timestamps everywhere.
3) State Estimation & Mapping (perception core)
- Pipeline: IMU leg odometry (1–2 kHz) → visual-inertial odometry (100–200 Hz) → factor-graph/fusion (30–60 Hz) → time-synced robot state.
- Scene Understanding: 2D/3D detection & segmentation for traversable space, obstacles, hands/objects; grasp pose estimator.
- Latency budget (95th-pct): sensing→state < 10–15 ms; sensing→object pose < 30 ms.
4) Whole-Body Control & Planning
- Controller: Whole-body controller (WBC) or MPC with contact constraints, friction cones, CoM/zero-moment margin.
- Planners: Footstep (terrain-aware), reach/grasp (IK + collision), and short-horizon loco-manipulation; reactive re-planning at 10–50 Hz.
- Shields: Safety supervisor (joint limits, thermal, torque/force caps), collision monitor, E-stop.
5) Adaptation (keep the robot competent under change)
- Online ID: Estimate mass/CoM shifts, contact/friction, actuator offsets; feed residuals to WBC/MPC.
- Policy Learning: Imitation + RL trained in sim with domain/dynamics randomization; periodic on-robot fine-tuning for edge cases.
- Fast Recovery: Meta-learning or residual policies to adapt in seconds; reset-free learning guarded by safety shields.
- Triggers: Drop in success/latency margins, new payloads, floor changes, lighting shifts, tool changeovers.
6) DataOps & MLOps (make improvement routine)
- Always-on logging: Raw sensors, state, actions, contact events, fails; keep synchronized bags.
- Offline improvement loop: Fit domain randomization to real logs; prune/quantize nets; regression tests on curated “pilot suite.”
- Versioning: Dataset + model + controller configs tracked; rollbacks are one command.
7) KPIs (acceptance gates)
- Pose drift (VIO): < 1% of path over a 50 m indoor loop.
- Perception-to-control latency: < 30 ms (95th-pct).
- Footstep placement median error: < 5 cm on flat; < 8 cm on uneven.
- Disturbance recovery: Lateral impulse ≈ 30 N·s → CoM recenters < 1.0 s, no fall.
- Grasp success (10 varied objects): ≥ 85%; re-grasp < 2 attempts.
- Adaptation: Success ≥ 80% after friction μ change 0.2↔0.7 or payload +5–10 kg without manual retune.
- Safety: Max unintended contact force < 150 N; E-stop reaction < 100 ms.
- Ops: Battery swap < 3 min; crash-free runtime ≥ 4 h.
Pilot Test Evaluation Rubric (quick-score 0–5)
How to use: Run each test block. Score 0–5, where 3 = pass (meets gate), 5 = stretch (exceeds), 1 = unacceptable. Weight by column “Weight.” Overall = Σ(score×weight)/Σ(weight).
Category | Metric (how measured) | Test Protocol | Pass (3) | Stretch (5) | Weight |
---|---|---|---|---|---|
Perception: State & Mapping | VIO drift (%) | 50 m indoor loop, loop-closure check | ≤1.0% | ≤0.3% | 10 |
Latency (ms) | Sensor→state→WBC 95th-pct | ≤30 | ≤18 | 6 | |
Object/hand pose (cm/°) | Known fiducial objects, 100 trials | ≤2 cm / ≤2° | ≤1 cm / ≤1° | 6 | |
Locomotion Stability | Footstep error (cm) | Flat & uneven (foam, ramps) | ≤5 / ≤8 | ≤3 / ≤5 | 10 |
Disturbance recovery (s) | 30 N·s lateral push, mid-stance | ≤1.0 | ≤0.6 | 8 | |
Falls per 10 min | Mixed flooring course | 0 | 0 | 6 | |
Manipulation Proficiency | Grasp success (%) | 10 household objects × 10 each | ≥85 | ≥95 | 10 |
Task time (s) | Pick-place-open drawer sequence | ≤60 | ≤40 | 6 | |
Adaptability (Domain Shift) | Friction change success (%) | μ≈0.2 (vinyl) ↔ μ≈0.7 (rubber) | ≥80 | ≥95 | 8 |
Payload adaptation (kg) | Add 5–10 kg backpack | ≥80% tasks succeed | No perf loss vs. baseline | 6 | |
Lighting robustness (%) | 200→20 lux, occlusions | ≥80 | ≥95 | 4 | |
Safety & HRI | Contact force cap (N) | Soft bump into torso/arm | ≤150 N | ≤100 N | 8 |
E-stop reaction (ms) | Instrumented stop | ≤100 | ≤60 | 6 | |
Proxemic compliance | 1 m bubble; auto-yield | Always | Smooth + anticipatory | 4 | |
Energy & Thermal | Cost of transport / duty | 0.8 m·s⁻¹ walk, 10 min | CoT ≤2.5; no throttling | CoT ≤2.0; cool headroom | 4 |
Reliability & Ops | Crash-free runtime (h) | Continuous mixed tasks | ≥4 | ≥8 | 6 |
MTTR (min) | From fault to resume | ≤10 | ≤5 | 4 | |
Data/MLOps Quality | Log completeness (%) | Fields present & synced | ≥95 | 100 | 4 |
Red-team/Stress Add-ons (unscored but required sign-off)
- Wet floor patch; loose cable on the ground; reflective surfaces; crowded corridor with moving people; unexpected bump during grasp.
- Tool changeover (e.g., gripper pad swap) without retune; firmware rollback test; dataset/model version mismatch drill.
Pass/Fail Guidance
- Greenlight pilot if every category ≥3 and weighted overall ≥3.5, with no Safety/HRI item <3.
- Limited pilot if overall ≥3 and all Safety/HRI ≥3 but ≤2 items in other categories may be 2 (with corrective plan).
- Hold otherwise.
Artifacts to Collect
- Synced logs (sensors, state, actions), per-trial CSV (success, times, errors), annotated video, and incident reports.
- Brief post-mortem for any score ≤2: root cause, patch, and re-test date.