Machine learning and Data mining in Home Automation


Home automation and Internet of Things are gaining popularity nowadays. In today’s world, Automatic systems are being preferred over the manual system. A Smart home automation system is based on making user life easier. The Internet of Things generates a huge amount of data. Additionally, it contains a huge number of sensors and their data which can control or monitor objects. This paper verbalizes about how big machine learning and data mining use for converting the normal home automation to the smart home automation system. Here the user of this system can control their home using their smartphone as well as computer or any devices which are connected to the internet. And also get suggestions [using prediction algorithm], reminder, and alerts additionally system interact with user


Inrecent years, there has been a growing interest among consumers in the smart home automation concepts. Nowadays there are various home automation systems are available in the market which provides users are able to control their home remotely with a single command on or off. The current research focuses more on the engenderment of the smart home, a home that is able to control and make a decision on its own. This study firstly Introduces IOT (Internet of things) and machine learning. These are more and more growing nowadays. Internet of things contains a huge number of sensors which can measure temperature, light, noise, distance, pressure, motion, speed and so on. These sensors also produce a large amount of data. Here we are talking about how we are using this data to produce more automation. Now a day we need effective and reliable solutions for storing that large amount of data which are produced by sensors based on big data and cloud based platforms, we believe that this data contains highly useful and valuable information and that not much attention has been taking in analyzing this data to make it smart. In our study, we investigate how data mining and machine learning techniques can play important role in finding an appropriate pattern and transforming them into valuable things for the end user of our system. Here we are using this data to predict user’s next step and provide valuable suggestions and give more automation.


Using the proper format makes it easier to analyze data using data analytics techniques. We could distribute main functions and make level vise categories.

· Visualization

· Statistics

· Data mining

· Machine learning

Functional Architecture

Here we can see the first level functional architecture of our system in the figure.

In Figure, we can see that the functional architecture is mainly divided into three part UI, Data analytics, Data Management.

Data Management

Data management layer is most important part of this functional architecture. It manages all data which are produced directly on indirectly by sensors. it collects data which are produced by sensors and then its store this data in their respectively databases or cloud platforms. And this data is useful for data analysis layer.

Data Analytics

Data analytics layer is responsible for analysis data which are produced by sensors and give appropriate or understandable format or graphical representation. In this layer, we are finding patterns with past data or analysis and compares with past analytical data and produced some predictable moments of end-user

User Interface UI

In our architecture, their are two types of user interface available, Enduser interface and Analysis Interface. End user interface contains all functionality which is operated by users like user can turn on or off any fan or light using this interface. End user interface directly connected with the user. Another one is analysis user interface which provides all analytical data and their graphical representation which produced by sensors. In this interface, we can configure all configuration of our system.


This figure shows the flow of the data processing architecture. First, we can get collect data from the physical world using different types sensors like LDR can meager light density, LM35 can meager temperature and etc. Then in the second step we can preprocess that data and convert into meaningful information. Machine learning and data mining can use this information to understand the user’s activity and find some appropriate patterns. Lastly its make Decision on considering all parameters.


Data collection

The data collection layer provides functionality to accumulate any data produced by devices present in the environment. Sensors and devices can push their data into database For instance in the home automation smart-home solution they send events messages containing their identifier, the type of data they produce and the value of the data. Events are occurring whenever the value measured has changed since the last emission.

Data Storage

According to our study, we have to perform data mining and machine learning algorithms, once we collect the data, we need to store their data into a dedicated database or cloud platforms. According to the volume of data to be the store we might need big data storage solution or standard relational database. In internet of things, there are a huge amount of sensors. Home automation each event issued by their devices or sensors readings.

Data Preprocessing

When we want to apply Data analytics techniques or machine learning algorithms Firstly we have to clean the data which are produced by the sensors or devices/modules. We have arranged this data in a standard form which is applicable to all data produced by different sensors. We have to define which data are important and which data are ignorable. After data preprocessing process we made an understandable database which is useful in next layers.

Data visualization

Visualizing data is an efficient and potent way for the terminus utilizer as well as for the accommodation provider to get an expeditious insight of the health status of the contrivances as well as of activities taking place in the habitation when of course, all contrivances operates correctly.

Data mining

Data mining is the process of finding a meaningful pattern out of a large data repository. In data mining task aims at discovering frequent part from a sequence of events. The benefit of identifying frequent part is we can evaluate the daily routing of the user. This approach has been useful to identify and predict the user activity.

Clustering: This task aims at grouping elements from an astronomically immense set of elements so that the elements inside the same group are more akin to each other, then element belonging to different groups.

Machine learning

Machine learning and data mining can give an important role to make any normal home automation to the smart home automation system. Its help to predict user activity.

Machine learning has a remotely homogeneous objective than Data mining with the difference that the patterns found are executable structures which can be applied to a fresh data in order to predict future data or infer incipient data

Classification: This task aim at appropriate determine a category to which a given observation belongs. The approach makes use of a training set containing observations correctly classified.

Regression: It is similar to the classification task in the case where possible values for the category are numerical instead of categorical. In the context of the Smart Home, a decision could be made on how to adjust the heater thermostat predicated on the same sensors quantifications.


Recently, the home automation and IoT (Internet of things) market are growing very fast and need vast range of development that can be carried out in the concept of smart home-automation. In our paper, we are trying to give more automation using machine learning and data mining concepts. Here we are trying to predict user activities and gave appropriate suggestion to the user. Our m­­­­­otive to provide more comfortable and convenience system for the smart user.


[1] Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data. Heidelberg: Springer-Verlag, 2007.

[2] U. M. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery in databases. In AI Magazine.

[3] Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers Inc., 2005.

[4] A. K. Jain, M. N. Murty, and P. J. Flynn. Data clustering: A review. In ACM Computing Surveys.

[5] B. I. Ahmad, F. Yakubu, M. A. Bagiwa, and U. I. Abdullahi. (2011). Remote Home Management: An alternative for working at home while away. World of Computer Science and Information Technology Journal (WCSIT), 1(4), pp. 144–147.

[6] A. W. Ahmad, N. Jan, S. Iqbal, and C. Lee, “Implementation of ZigBee-GSM based Home Security Monitoring and Remote Control system,” in Proc. 2011 IEEE 54th International Midwest Symposium on Circuits and Systems (MWSCAS), Seoul, pp.1–4.

[7] T. Begum, M. S. Hossain, M. B. Uddin, and M. S. H. Chowdhury, “Design and development of activation and monitoring of home automation system via SMS through a microcontroller,” in Proc. 2009 4th International Conference on Computers and Devices for Communication (CODEC 2009), Kolkata, pp.1–3.



  1. I was more than happy to seek out this web-site.I wanted to thanks for your time for this wonderful learn!! I definitely having fun with each little bit of it and I have you bookmarked to check out new stuff you blog post.
    Rexuiz Shooter

Leave a Comment