Beyond The ESM Administrator Guide
Beyond The ESM Administrator Guide
Beyond The ESM Administrator Guide
Administrators Guide
Nathan Tisdale, Advanced Support Engineer
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Introduction
Nathan Tisdale
Advanced Support Engineer
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Agenda
A troubleshooting perspective:
Data flow
Oracle vs CORR-Engine
Live monitoring
Advanced Management Console
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Audience
Is this presentation for you?
ArcSight Administrator
Responsible for ensuring continuous event flow through ESM
Enough experience to be curious about Thread Dumps
Participation
Q&A
Hallway chats are welcome
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Data flow
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
ArcSight
SmartAgent
ArcSight Manager
ArcSight Web
ArcSight
SmartAgent
ArcSight Database
6
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Start-of-flow
Threads
(Normalization)
SeededJsse
Listener threads
Pre-persistor
Threads
Post-persistor
Threads
(rules engine)
Database
Event retrievals
Active channel
queries
Report queries
XCPUDMPC
Threads (Data
Monitors)
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Trend queries
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Processing stages
Event Verifier
10
Event Category
Adder
Event Asset
Resolver
Annotation
Initializer
Geo Info
Adder
Security
Event Persistor
Rules
engine
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Threat Level
Handler
Data
monitors
Event Forwarder
Symptom
Making sense of Agent State
Queues filling caused by
Database performance
Disk I/O
Slow rules engine processing
Slow Data Monitor processing
Symptom
Events Cache
STM eps < P-A eps
11
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
13
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Application logs
Manager
Oracle
<ARCSIGHT_HOME>/logs/default/*.log*
SERVER.LOG
SERVER.STD.LOG
SERVER.STATUS.LOG
SERVER.REPORT.LOG
SERVER.SQL.LOG
SERVER.LICENSE.LOG
PARTITIONMANAGER.LOG
PARTITIONARCHIVER.LOG
PARTITIONCOMPRESSER.LOG
PARTITIONSTATSUPDATER.LOG
<ORACLE_HOME>
/admin/arcsight/bdump/ALERT_<LISTENER>.LOG
/network/log/LISTENER.LOG
/network/log/SQLNET.LOG
14
CORR-Engine
/opt/arcsight/logger/current/arcsight/logger/logs/*
/opt/arcsight/logger/data/mysql/*.log*
/opt/arcsight/logger/data/pgsql/serverlog*
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Log rotation
Log files are always limited in size
10MB default
Automatic log file rotation
10 files are kept, plus the current file
Can extend logging
<ARCSIGHT_HOME>/config/server.properties
Copy settings from <ARCSIGHT_HOME>/config/server.defaults.properties [DO NOT EDIT THIS FILE]
# The maximum size of the log file before it
# will be rolled over. The size is specified
# in MB (MegaByte).
log.channel.file.property.maxsize=10MB
# The maximum number of backup files to create
# for rolling over.
log.channel.file.property.maxbackupindex=10
15
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
SERVER.LOG
Initialization messages
General progess messages
Event batch insert times
Garbage collector information
Critical warnings
Uncaught exceptions
Watchdog messages
Wrapper manages life cycle of manager processes
SERVER.STATUS.LOG
Information from Mbeans
Agent throughputs and status
Active Lists statistics
Rule and Data Monitor resource consumption
Also see manage.jsp
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
SERVER.REPORT.LOG
Needs to be enabled
Useful for Oracle DBA
SERVER.CHANNEL.LOG
Active Channel queries
PARTITION*.LOG
SERVER.PULSE.LOG
SERVER.LICENSE.LOG
License compliance per 24hrs
Approaching or exceeded limit(s)
17
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Other data
Manager
Oracle
Thread dump
Heap dump
Operating system
System logs
Performance data
Database sessions
RDA
AWR
lsinvetory
CORR-Engine
Session Waits
Core Dump
18
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Data to collect
99% of time, bottleneck found on manager
Thread Dumps
System tables
If reproduction to be performed
Logs
Agent logs
Manager logs
Oracle-based Manager
Alert Log
DB Sessions
CORR-E based Manager
Session Waits
mysql.log
19
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Collecting logs
ArcSight Sendlogs
Wizard interface allows user to easily gather:
Manager logs
Agent logs
Web logs
Console logs
Oracle Alert log
Thread Dumps
Session Waits
Output from SQL
Run from manager or console
./arcsight sendlogs
20
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
21
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Whiner messages
System alerts via email
Why
Where
Subsystem failures
Database connection problem
Event insertion times high
SSL certificate expiration
Database space shortage
Running out of space
Usually event space
Sometimes system table space
Partition manager failures
Get your DBA!
22
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Memory utilization
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Memory usage
Manager allocates memory in Java heap
Server.std.log
2006/02/22 23:22:51 | Memory Status: 765.6 MB Used, 1,014.0 MB Max
2006/02/22 23:22:52 | [Full GC
2006/02/22 23:22:58 | 797362K->471587K(1038336K), 5.9847261 secs]
Java heap is garbage collected
Server only allocates memory
Java VM reclaims unused memory automatically
Manager doesnt know how much garbage is in the heap
Reported memory usage includes garbage
24
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Memory
Young
Tenured
Minor GC
Only collects young generation
May expand to entire heap, and become a major collection
[GC 929899K->838966K(1036928K), 0.0353791 secs]
Major GC or Full GC
Collects both young generation and tenured generation
[Full GC 932135K->542955K(1036928K), 3.9721866 secs]
25
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
GC pause
Stop the world GC
When GC is happening, everything else is stopped
Pause Time
[Full GC 932135K->542955K(1036928K), 3.9721866 secs]
26
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Working set is defined as the memory that is in actual use and doesnt have any garbage.
Working set of the Manager can be found as above, immediately after a Full GC
27
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Too large
Peak performance is good, but
Full GC takes long time to finish
Manager could get killed by Wrapper for being hung for a long time
28
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Out of memory
Server will restart on out of memory errors
Check Logfu
Check CapsManager from server.status.log to check overall memory utilization by Data
Monitors, channels, Active Lists etc.
If you see a spike
Multiple memory intensive tasks at the same time?
Increase heap size
Memory leak
Memory usage keeps growing
Increasing heap size only delays the problem
Memory leak is hard to track down
Contact support
29
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Log analysis
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Exceptions
Details of application errors
Java construct encapsulates some failures
Coding errors
Transient bugs
A full stack trace is included
Shows where in the code the error occurred
Not all exceptions are equal
Misclassified or not significant impact
Sometimes related to content
31
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
32
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Delayed events
SERVER.LOG
default.com.arcsight.util.TimedRingBuffer][increment] Throwing out increment X, increment time = X,
acceptable range X - X (discarded=X)
Active channel
Gaps between Manager Receipt Time and End Time and Agent Receipt Time
Device Receipt Time
Connector Receipt Time (a.k.a. Agent Receipt Time)
Manager Receipt Time
Start Time
End Time
33
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Database connectivity
SERVER.STD.LOG
Connectivity Issues
SERVER.STD.LOG
SUBSYSTEM STATUS CHANGED
Persistence Rate
should take less than 100ms
INFO | jvm 2 | 2009/05/07 20:41:58 | (02-Pre-SecurityEventPersistor330) Persisted 100 events
in 32 ms.
INFO | jvm 1 | 2009/05/08 11:20:53 | (02-Pre-SecurityEventPersistor1) Persisted 100 events
in 3,698 ms
34
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Manager busy
SERVER.STD.LOG
Manager stops accepting events
INFO | jvm 1 | 2005/04/04 00:42:26 | WARNING: '1' agent requests REJECTED because the limit of '64'
agent threads was exceeded.
35
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Thread Dumps
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Start-of-flow
Threads
(Normalization)
SeededJsse
Listener threads
Pre-persistor
Threads
Post-persistor
Threads
(rules engine)
Database
Event retrievals
Active channel
queries
Report queries
XCPUDMPC
Threads (Data
Monitors)
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Trend queries
A Java snaphsot
Dont restart or reboot before collecting!
Why Thread Dumps
Stack trace for each thread in the VM
Many different threads
Bottleneck area usually identifiable
Session Waits or DB Sessions needed to correlate database activity
38
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Servlet engine
SeededJsseListener
Read bytes from network sockets
Convert read bytes to Java Objects Security Event Batch
Place event batches into queue for Flow 1
39
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Flow 1: Start
Start-of-flow
Vulnerability Scanner Reports
Place event batches into queue for Flow 2
40
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Flow 2: Pre-persistor
Pre-SecurityEventPersistor
41
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Flow 3: Post-persistor
Post-SecurityEventPersistor
42
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Content: Dashboards
XCPUDMPC-Thread
43
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Logfu
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Why Logfu?
Logfu is not an officially supported tool
Discerning patterns
Examines server.log, server.std.log, and server.status.log
Syntax
ArcSight logfu m noplot
-m is for manager
-noplot skipps plotting on graph
Outputs logfu.html to logs/default/Logfu_<date>/
Interesting data points
Famous Last Words Why did it die
Exception Groups Quickly identify repeating exceptions
Memory Identify growth in memory consumption
Event Insertion Is the database/disk able to keep up
45
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Memory patterns
46
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
47
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Shutdown patterns
48
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
49
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
50
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Status on demand
https://<HOST_NAME>:8443/arcsight/web/manage.jsp
Interesting Mbeans
Agent State Tracker
Specific and overall EPS for connectors
SessionManager
How many users are logged in
SubsystemStatus Tracker
Whiner
ActiveList Monitoring
Memory consumption
Channels
How many
Validating the SQL
52
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
53
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Mbean: RulesEngine
54
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Mbean: AgentStateTracker
55
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.
Thank you
Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice.