Botnet Detection Using Passive DNS
نویسنده
چکیده
The Domain Name System (DNS) is a distributed naming system fundamental for the normal operation of the Internet. It provides a mapping between user-friendly domain names and IP addresses. Cyber criminals use the flexibility provided by the DNS to deploy certain techniques that allow them to hide the Command and Control (CnC) servers used to manage their botnets and frustrate the detection efforts. Passive DNS (pDNS) data allows us to analyse the DNS history of a given domain name. Such is achieved by passively collecting DNS queries and the respective answers that can then be stored and easily queried. By analyzing pDNS data, one can try to follow the traces left by such techniques and be able to identify the real addresses of the botnet Command and Control servers. For instance, we expect malware-related domain names to have lower Time-to-Live (TTL) values than legitimate and benign domains. The aim of this research is the development of a proof-of-concept able to automatically analyze and identify botnet activity using pDNS data. We propose the use of machine learning techniques and devise a set of 36 different features to be used in the classification process. With two weeks of pDNS data we were able to set up, create and test different classifiers, namely k-Nearest Neighbours (kNN), Decision Trees and Random Forests. Using all-purpose blacklists we were able to achieve an accuracy of 97%, having a False Positive Rate (FPR) of 3%. However, with only two weeks of data it is not possible to find sufficient domain names used for botnet CnC servers such that we are able to extract statistically significant results. Furthermore, some of our initial assumptions hold when analysing botnet-related domain names but do not for malware-related domain names. For instance, the average TTL value for legitimate domain names is twice lower than for malware-related domain names. We believe this is due to the fact that only a small portion of our blacklist is composed of botnet-related domain names that have small TTL values. In addition, many legitimate domain names make use small values of TTL possibly to increase the availability of the services provided. Related work such as Notos [2], EXPOSURE [10] and Kopis [3] reported similar accuracy levels, however with lower FPRs. This might be due to the fact that while our feature set is extracted solely from pDNS data, such systems include also WHOIS and Autonomous System (AS) data. This data is also useful for detection of malware-related domain names and it contributes to build more accurate and precise systems. [
منابع مشابه
Mentor: Positive DNS Reputation to Skim-Off Benign Domains in Botnet C&C Blacklists
The Domain Name System (DNS) is an essential infrastructure service on the internet. It provides a worldwide mapping between easily memorizable domain names and numerical IP addresses. Today, legitimate users and malicious applications use this service to locate content on the internet. Yet botnets increasingly rely on DNS to connect to their command and control servers. A widespread approach t...
متن کاملDetection of fast - ux botnets through DNS tra c analysis
Botnets are networks built up of a large number of bot computers, which provide the attacker with massive resources, such as bandwidth, storage, and processing power, in turn, allowing the attacker to launch massive attacks, such as Distributed Denial of Service (DDoS) attacks, or undertake spamming or phishing campaigns. One of the main approaches for botnet detection is based on monitoring an...
متن کاملBotnet Detection Through Fine Flow Classification
The prevalence of botnets, which is defined as a group of infected machines, have become the predominant factor among all the internet malicious attacks such as DDoS, Spam, and Click fraud. The number of botnets is steadily increasing, and the characteristic C&C channels have evolved from IRC to HTTP, FTP, and DNS, etc., and from the centralized structure to P2P and Fast Flux Network Services. ...
متن کاملDetecting Botnet Activities Based on Abnormal DNS traffic
The botnet is considered as a critical issue of the Internet due to its fast growing mechanism and affect. Recently, Botnets have utilized the DNS and query DNS server just like any legitimate hosts. In this case, it is difficult to distinguish between the legitimate DNS traffic and illegitimate DNS traffic. It is important to build a suitable solution for botnet detection in the DNS traffic an...
متن کاملMonitoring of the DNS Infrastructure for Proactive Botnet Detection
Botnets enable many cyber-criminal activities, such as DDoS attacks, banking fraud and cyberespionage. Botmasters use various techniques to create, maintain and hide their complex C&C infrastructures. First, they use P2P techniques and domain fast-flux to increase the resilience against take-down actions. Second, botnets encrypt their communication payload to prevent signature based detection. ...
متن کامل