Execution Related Issues
Q1: How to troubleshoot execution timeouts?
⚠️ Four Major Causes:
- Process killed --> Resource cleanup failed
- Thread overload --> CPU usage > 90%
- Network congestion --> Sampling data delay
- Configuration error --> Timeout threshold too low
🔍 Diagnostic Tools:
bash# Check system load top -H -p <Execution Process PID> # Verify sampling delay cat /opt/AngusTester/data/exec/ExecutionID/runner.log
Q2: How to resolve startup failures?
🚫 Three Failure Scenarios:
Failure Type Symptoms Solution Node unavailable "No available nodes found" Restart execution agent service (AngusAgent) Agent exception "Connection refused" Check firewall settings for ports Script error Configuration validation failed Debug scripts and scenarios first
📍 Diagnostic Path:
Execution details page → Error logs → Check "Failure reason code"
Q3: How to monitor execution status in real time?
📊 Four-Dimensional Monitoring Dashboard:
- Execution progress --> Percentage progress bar
- Resource consumption --> CPU/Memory curves
- Sampling quality --> Data points/5 seconds
- Exception alerts --> Real-time pop-ups
🚨 Critical Thresholds:
- Memory usage > 80% → Automatic sampling rate reduction
- Progress stalled for 5 minutes → Pause execution
Q4: How to recover from a stuck execution?
🔄 Three-Level Recovery Mechanism:
- Auto-resume: Automatically restarts after 30 seconds of no response
- Manual intervention: Console → Execution monitoring → Force resume
- Log diagnosis: Analyze control logs or export execution snapshots to examine thread stacks
💡 Preventive Measures:
Enablepre-check mode
to verify resource availability in advance
Q5: How to improve execution stability?
🛡️ Five-Dimensional Reinforcement Plan:
Optimization Area Configuration Suggestion Resource isolation Limit max threads per task Timeout strategy Set timeout to 2x sampling interval Heartbeat check Status reporting every 5 seconds Resource reservation Reserve 20% system resources