Learn how to manage and monitor Linux processes. Process signals, priorities, and monitoring tools.
Process management is essential for Linux administration. This guide covers process control and monitoring.
# List processes
ps aux
ps -ef
# Real-time monitoring
top
htop
# Process tree
pstree
# Find process by name
pgrep nginx
pidof nginx
# Send signal
kill -SIGTERM 1234
kill -9 1234 # Force kill
# Kill by name
pkill nginx
killall nginx
# Graceful shutdown
kill -SIGTERM $(pidof nginx)
# Change priority
nice -n 10 command
renice 10 -p 1234
# Real-time priority
chrt -f 50 command
# Run in background
command &
# Bring to foreground
fg
# Detach with nohup
nohup command &
# Use screen/tmux
screen -S session
tmux new -s session
# Process resources
ps aux --sort=-%mem
ps aux --sort=-%cpu
# System resources
vmstat 1
iostat 1
Manage processes using ps, kill, and nice. Monitor with top, htop, and system monitoring tools.
For Process Management and Monitoring in Linux, define pre-deploy checks, rollout gates, and rollback triggers before release. Track p95 latency, error rate, and cost per request for at least 24 hours after deployment. If the trend regresses from baseline, revert quickly and document the decision in the runbook.
Keep the operating model simple under pressure: one owner per change, one decision channel, and clear stop conditions. Review alert quality regularly to remove noise and ensure on-call engineers can distinguish urgent failures from routine variance.
Repeatability is the goal. Convert successful interventions into standard operating procedures and version them in the repository so future responders can execute the same flow without ambiguity.
For Process Management and Monitoring in Linux, define pre-deploy checks, rollout gates, and rollback triggers before release. Track p95 latency, error rate, and cost per request for at least 24 hours after deployment. If the trend regresses from baseline, revert quickly and document the decision in the runbook.
Keep the operating model simple under pressure: one owner per change, one decision channel, and clear stop conditions. Review alert quality regularly to remove noise and ensure on-call engineers can distinguish urgent failures from routine variance.
Repeatability is the goal. Convert successful interventions into standard operating procedures and version them in the repository so future responders can execute the same flow without ambiguity.
For Process Management and Monitoring in Linux, define pre-deploy checks, rollout gates, and rollback triggers before release. Track p95 latency, error rate, and cost per request for at least 24 hours after deployment. If the trend regresses from baseline, revert quickly and document the decision in the runbook.
Keep the operating model simple under pressure: one owner per change, one decision channel, and clear stop conditions. Review alert quality regularly to remove noise and ensure on-call engineers can distinguish urgent failures from routine variance.
Repeatability is the goal. Convert successful interventions into standard operating procedures and version them in the repository so future responders can execute the same flow without ambiguity.
For Process Management and Monitoring in Linux, define pre-deploy checks, rollout gates, and rollback triggers before release. Track p95 latency, error rate, and cost per request for at least 24 hours after deployment. If the trend regresses from baseline, revert quickly and document the decision in the runbook.
Keep the operating model simple under pressure: one owner per change, one decision channel, and clear stop conditions. Review alert quality regularly to remove noise and ensure on-call engineers can distinguish urgent failures from routine variance.
Repeatability is the goal. Convert successful interventions into standard operating procedures and version them in the repository so future responders can execute the same flow without ambiguity.
LLM Gateway Design for Multi-Provider Inference. Practical guidance for reliable, scalable platform operations.
How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.
Explore more articles in this category
Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.
Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.
Concrete systemd unit patterns that reduced flakiness: restart policies, resource limits, and structured logs.