Disk Full Error
Recovering from a Disk Full error
If the Trisul disk fills up 100% you may not be able to access the web interface and of course Trisul will stop working. To recover from this error follow these steps.
-
Step 1: Mount new partition
Create a new partition with enough space and mount the new partition, say on
/mnt/trisul_extra
Trisul Hub Configuration File - Trisul DocumentationWe will be moving some packet capture and metrics to this new area to free up some space.
-
Step 2: Move the Probe archive to the new volume
The idea here is to move the packet capture
archive
directory to the new volume and create a soft link to point to it in the old location.cd /usr/local/var/lib/trisul-probe/domain0/probe0/context0/caps/
mv archive /mnt/trisul_extra/probe_archive
ln -sf /mnt/trisul_extra/probe_archive archive -
Step 3:Check if enough disk is freed up and repeat
Use the
df
command to check if now enough disk is available. If it is not available repeat Step 2 for the/ref
directory and then if required for the/oper
directory.[Check disk space]At this step, if disk usage drop to about 70-80% you can stop here and skip Step 4. You should be able to restart the Web Server and then the other Trisul components.
-
Step 4: Move the Hub archive to the new volume
Similar to the probe node we want to move the Hub data to the new partition and create a soft link to point to it.
cd /usr/local/var/lib/trisul-hub/domain0/hub0/context0/meters/
mv archive /mnt/trisul_extra/hub_archive
ln -sf /mnt/trisul_extra/hub_archive archive
Usually at this point you should have enough disk space freed up. Now you need to tune the SlicePolicy on both the Hub and Probe Config files so they dont fill up again.
References
- trisulProbeConfig.xml– for setting max GB retention of pcaps.
- trisulHubConfig.xml– for setting days of metrics.
Best practices
Here are some best practices for large deployments.
- Run Trisul for 3-4 days to get an idea of data growth per day. Use that information to size the disks.
- PreferXFSfilesystem over EXT4 due to the large number of inodes supported
- Consider usingLVMso you can easily expand a filled up volume by adding new disk capacity
- PreferRAID-0 for the ProbePCAPSfor higher write performance
- PreferRAID-5 for the Hub for resilience
- For large enterprises, mount the
/archive
on yourNASif you have one - Try to usePCAPpruning ruleseg, dont store Netflix,YouTube videos packets