Thursday, September 10, 2015

Cyber physical attacks to critical infrastructure (Part III: Detection technologies)


Network intrusion detection System (NIDS)


According to the taxonomy of intrusion detection systems defined by Debar and its working group, the most suitable  System is shown in the following figure:




The detection method should not be based on signatures since it should be frequently updated and it does not offer protection against 0-day vulnerabilities, making detection behavior as the most appropriate choice.
The behavior detection should be passive to be as non-intrusive as possible in the network and not interfere with the commands and actions that are exchanged over the network.
Given the importance of the transitions  have been in the control of industrial processes, the NIDS should consider this type of paradigm, and finally should be monitored continuously since these networks are operating in 24x7x365 basis.

Regarding detection technology for behavioral anomalies, there are several alternatives: inspection message headers (headers) detection, inspection message payload (Payload) detection or a combination of both. In the present note we will use the last option as it is the only one capable of detecting this type of semantic attacks and is used by the deep protocol behavior inspection technology we propose as network intrusion detection in critical infrastructure.

 

NIDS based on deep protocol behavior inspection


Once selected detection technology we will explain how to implement it in such environments. Since its operation is based on detecting events that differ from the normal behavior (anomalies), we must first build the pattern (behavioral blueprint).

The construction of this pattern can be performed on a specific-based manner (introducing the topological and operational information network) or unattended using learning-based technology. The first option is rarely useful as the knowledge of low-level details in the implementation of control networks organizations own is in many cases dating back to the FAT (Factory Acceptance Test) or the SAT (Site Acceptance Test), so usually very old information being outdated and not maintained systematically through change management procedures in line with best practices.

Selecting unattended construction method by learning, we must remember that it is very important that this normal behavior pattern is built in an environment as similar as possible to the production environment on which detecting anomalous behavior is performed.

The scheme of operation of this type of intrusion detection sensors is as follows:


Although learning is automatic it must always be adjusted by control engineers who are familiar with the process to eliminate any undesired operation generated by unscheduled interventions once verified by the control personnel. Additionally, in the phase detection such events should be able to be included in the pattern of behavior (Blueprint)  to avoid unwanted alerts (false positives).

The behavioral blueprint obtained after the learning and customization phase includes the following elements:

Control Network Communication profile

At this time the NIDS knows every possible tuple in the control network (traffic matrix):


Src IP,Src   Port -> Dest. IP,Dest Port



From this moment, we can be alerted by:

• New devices on the network
• Devices trying to connect to our network that are not in our Model
• Devices sending information out of our network to devices out of the model.

Protocols, messages and values matrix

In order to detect advanced operation issues or attack to processes we need to use the technology of deep protocol behavior inspection (DPBI), since with this we will know:

·      The control protocols operating in the network
·      Messages that are used within each protocol
·      The distribution of values within each message field of actual network control protocols.

All this information must be organized in a logical manner in order to obtain the pattern of behavior which subsequently compares all messages obtained from the network. The DPBI NIDS is responsible for generating this model during the learning phase using its advanced technology on behavior modelling.

From this point we can start the detection phase and be alerted of any communication diverge from the newly built behavioral blueprint.

Operational Correlation


Despite the power detection technology DPBI control environments, we need to be able to generate alerts to detect cyber attacks on physical process (operations that are within the behavior pattern and executed from the control network stations also found in the pattern.).

A clear example of this would be a kind Aurora attack and run from a SCADA server to transmit orders for opening and closing of switches out of phase to a remote unit (RTU) in a substation, using the IEC 104 protocol.
To detect this cyber attack, we should be able to store all IEC 104 opening and closing aimed at RTU we found in the control network and estimate the time difference on the immediately preceding command sent to the RTU messages.
To do this the network intrusion detector DPBI also be able to provide the functionality described above. (Operational correlation).

In the case of the NIDS DPBI solution for SCADA SCAB (Security Awareness Control Box for SCADA), this correlation is implemented by deploying additional logic (script type program) that makes this correlation.

An example of a function of this script is as follows:

function new_connection_data(conn, data, is_upstream)
    local record = find_flow(conn)
    if record ~= nil then
        record.up_bytes = conn:upstream_num_bytes()
        record.down_bytes = conn:downstream_num_bytes()
        record.up_pkts = conn:upstream_num_pkts()
        record.down_pkts = conn:downstream_num_pkts()
        record.payload_up_bytes = conn:upstream_num_payload_bytes()
        record.payload_down_bytes = conn:downstream_num_payload_bytes()
    end     
end

Future trends: S-IDS


The combination of detection technology based on control protocol behavioral anomalies, together with the operational correlation allows us to detect cyber-physical attacks on critical infrastructure processes, yet are somewhat craft in regard to the implementation operational and temporal correlations.

To solve this problem it is being investigated in new detection technologies that includes this information in the behavioral pattern automatically.
One of this technology is called Sequence-aware Intrusion Detection System and raises a number of novel approaches in generating a behavior pattern, such as control of the order in which messages are sent and received to the Control elements from the servers, the time between state transitions and sending messages and standard deviation of the time.

The block architecture of a system of this type would be:


In the learning phase information from sources model input (control network protocols messages, log file entries and values of the commands of the process) would be collected and would feed the sequencer to maintain timing trace, before passing to process model generator.

As in the case of NIDS DPBI based, once the learning phase is finished would enter in detection mode. First experimental results for SCADA Waters sector have been achieved and work is in progress to decrease false positive rate (FPR) and noise reduction for the detection phase.

This is just one of today research paths on intrusion detection for industrial control system, but still is under development and validation.

Wednesday, September 9, 2015

Cyber physical attacks to critical infrastructure (Part IV: Detecting Attacks)

Detection of cyber-physical attacks


All cyber-physical attacks exposed earlier in this technical note can be detected using a combination of technologies in network intrusion detection as deep protocol behavior inspection (DPBI) and operational correlation.

  •       Aurora Attack type: After creating the DPBI pattern of normal behavior for the control network, a script that would monitor the sequence of write commands received by the RTUs in an arbitrary period of time (seconds or milliseconds) would be deployed. In the event that an order of writing CLOSE was sent to a given RTU with a previous OPEN value received, at a lower time than the allowed time interval (0.2s), we would fire an alert.


  • Water hammer / discharges Attack Type: Assuming a scenario of progressive control as in Figure 2, would only be possible to reach the completely closed (or open) state for the valve from a previous state with V = 30.



Any value sent in a write command to the PLC control valves would be compared to the last write value sent. If the difference between the value of writing command and the immediately preceding received exceeded the maximum increase in programmed control (∆V = 10), an alert would skyrocket.
Additionally, all values in a command not included in the behavioral blueprint would trigger an alert. (Eg V> 40)

Remarkably, the importance of the anomaly differs depending on the detected transition and a criticality hierarchy may be established. In the example of Figure 2, the abnormal transition E3 -> E5 trigger an alert warning, while the transition anomaly E1 -> E5 trigger a critical alert.

·      Alteration of the amount of production (vinyl acetate monomer): Any value received in the write message on the PLC that controls the temperature of the reactor outside the distribution of values of the behavior blueprint would trigger an alert.

·      Attack by temperature to chemical reactors: As in the case of water hammer, any write command sent to the PLC progressive temperature control would be compared with the immediately preceding. If the difference between the value of writing and the immediately preceding received exceed the maximum temperature defined threshold, an alert would be sent


·      Fake maintenance: Send commands to the control elements in order to conceal attacks on process never would have formed part of the original behavior pattern built for the network, so any transmission of those would trigger an immediate alert.

We can summarize this in the following table:



It is important to note that the semantics needed to detect these attacks through additional programming logic comes from the deep knowledge of the processes controls and possible weaknesses of them. Based solely on deep protocol inspection (DPI) systems could not detect such attacks and it is necessary to use both DPBI and Operational correlation to detect them all.

There is another very powerful implementation of the operational correlation in detecting how allowed control operations (nodes, protocols and distribution of values) are executed on specific time frames. (A firmware update of a PLC or RTU can be normal within one business day and exceptional if done on weekends or at night).


Conclusions


The new attacks on the cyber-physical systems of industrial processes running on critical infrastructure, require the adoption of new strategies capable of detecting without interfering with normal operation.

The change in the functional structures (common Managers and multidisciplinary teams) and the procedures at critical infrastructure operators (Risk Analysis and procurement requirements), it is imperative to address this kind of physical attacks.

The only technology capable of detecting attacks from within the control network using protocols, messages and values ​​allowed within the same, but in order or frequency other than normal is the use of intrusion detection systems that support the deep protocol behavior inspection (DPBI) with the ability to implement correlation of operational events.

The implementation of these technologies in critical infrastructures control networks should be considered seriously by those responsible for the cyber security of these facilities and the authorities responsible for monitoring compliance with the PIC 8 / 2011 Act.


In the future, Sequence-aware NIDS (S-NIDS), or similar technologies, may help simplify the implementation of these systems in control networks significantly improving the behavior pattern generation and subsequent maintenance, protecting processes and cyber-physical systems on critical infrastructures.

Friday, July 3, 2015

New Detection Technologies: Ouija 2.0 takes care of Critical Infrastructure



On my last post in the CCI Blog, I described how American Water Works Association Cyber Security Framework didn’t addressed the detection Cyber Security events measures for these control systems.
Today I have seen a good tweet (as always) from our colleague Joel Langill (@SCADAhacker)  advising about a new Tripwire studyon Critical Infrastructure.  In their study they stated than “86% energy security professionals believe they can detect a breach on critical systems in less than 1 week, and suddenly I remembered the last CCI report on Industrial Cyber Security for Spain in 2015. According to that report, these were the Cyber Security solutions deployed on those Networks:


As you can see, SIEM, log correlation, IDS and IPS technologies are not very popular on such organizations, so questions are:
  •        Who has responded in the Tripwire survey? (Surely not Spanish Industrial companies)
  •         If you are not monitoring cyber security events, how can you detect them?
  •        Why Duqu 2.0 has been hitting ?
The only rational response to this astonishing detection rate of 86%  is …….  Ouija !!!! (Of course, version 2.0 with snmp and syslog support)

Monday, June 29, 2015

The Dragonfly campaign hangover in Spain - Almost one year after

We have good news !!!!.
After almost one year, we have less Industrial Control Systems open to Internet in Spain.

It has been a long time campaign, but may be someone has got the message and we have contributed with an awareness improvement.

Let's see how things are going in the next month.
Remember, always close the door to your SCADA !!!

Wednesday, June 24, 2015

Finding the needle in the Water Tank (At least, you should look for it)



Water Management Cyber Security guide from the American Water Works Association is bypassing certain critical controls on ICS Networks. 


Are we still in time in Spain to avoid this mishap? 


I know I should be looking for the needle in the Haystack, but looking for that in a Waste Water Tank is not easy either. When you have to protect Drinking & Waste Water control networks plants you should be aligned with the best practices and be national regulation complaint.

Since February 2014, EEUU deployed its Cyber Security Framework from the National Institute of Standards and Technology (NIST). In that General Framework, detection of behavioral anomalies, is recognized in the third function defined by the Framework: Detect.

Under the function Detect (DE) is the category of Anomalies and events (AE), and under this, there is one sub management categories established:

· DE.AE-1: A baseline of network operations and expected data flows for users and system is established and managed.

(The need to detect anomalies in control networks is found in the following standards: COBIT 5 DSS03.01, ISA 62443-2-1:2009 4.4.3.3 and NIST SP 800-53 Rev. 4 AC-4, AC-3, CM-2, SI-4)

   


























In the Water sector and in 2014 too, AWWA deployed another Cyber Security Framework (Process Control System Security Guidance for the Water Sector). In this guide there was a Cross Reference to NIST Cybersecurity Framework, and the only two categories not addressed in this cross reference were related with cyber security events detection!!! 

   















Why is this? I don’t really know, but what I do know is that continuous security monitoring is the only way to detect any threat in your network and managed risk in a proper way. 

Now we are going to have our water sector regulation in Spain but … Are we going to forget about anomalies detection in our permanent security measures too? 

If so, we will never find the needle (or the virus) in our water tanks.

Tuesday, March 3, 2015

2015 Mission: Simplify Cyber Security (No more excuses, Mr CEO)

I've been in this for years and it seems like I am always living my first projects as security auditor. It is true that technology issues on the "Red Team" have improved a lot: Bad USB and other Firmware attacks, Air Gap attacks based on cyber physical systems, Ultra advanced malware evading 99% of the AV solutions, etc. But, What about the "Blue Team" (Organizations)?

In the last HP Cyber Security Report for 2015, you can read the following:

  • 44% of known breaches in 2014 came from vulnerabilities that are between two and four years old
  • Misconfigurations of servers as the top vulnerability in 2014
This is really a good (bad) indicator about Cyber Security awareness in organizations.
But, Why is this happening?.

If we look at other important report for 2014 from Information Week - Darkreading, we can find that most of the organizations surveyed have declared Complexity as the main concern for last year:



 1 Million Dollar question is: How they can thing patching and right configuring (hardening) is complex?

My only response to this question is they don't have security or even IT staff enough to apply the right procedures in the right manner. But if so, another question arises: Why are they not contracting Security Managed Services?
May be Mr CEO thinks this is very expensive and complex, but I will try to explain my approach to him in the next posts.