Clustering Devices In An Internet Of Things
Clustering devices in an Internet of Things ('IoT'), including: receiving, by a device clustering module, a characteristic set for a device, wherein the characteristic set specifies one or more device attributes and an attribute value for each device attribute; clustering, by the device clustering module, the device into an attribute level cluster based on the one or more device attributes specified in the characteristic set for the device; and clustering, by the device clustering module, the device into a value level cluster based on the attribute value for each device attribute, wherein the value level cluster is a subset of the attribute level cluster.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The field of the invention is data processing, or, more specifically, methods, apparatus, and products for clustering devices in an Internet of Things ('IoT').
2. Description of Related Art
In an Internet of Things ('IoT), a wide variety of devices may exist. Each device may include different attributes, different capabilities, be located at different places, and so on. Without identifying common features and aspects of the many devices that make up an IoT, managing heterogeneous devices in the IoT may become difficult.
SUMMARY OF THE INVENTION
Methods, apparatus, and products for clustering devices in the Internet of Things ('IoT'), including: receiving, by a device clustering module, a characteristic set for a device, wherein the characteristic set specifies one or more device attributes and an attribute value for each device attribute; clustering, by the device clustering module, the device into an attribute level cluster based on the one or more device attributes specified in the characteristic set for the device; and clustering, by the device clustering module, the device into a value level cluster based on the attribute value for each device attribute, wherein the value level cluster is a subset of the attribute level cluster.
DETAILED DESCRIPTION OF EXAMPLE EMBODIMENTS
Example methods, apparatus, and products for clustering devices in the Internet of Things ('IoT') in accordance with the present invention are described herein. IoT refers to uniquely identifiable objects (things) and their virtual representations in an Internet-like structure. IoT can be thought of as a dynamic global network infrastructure with self configuring capabilities based on standard and interoperable communication protocols in which both physical and virtual things have identities and attributes. Such physical and virtual things can be seamlessly integrated into traditional networks. IoT can make use of radio-frequency identification ('RFID') technologies to identify and inventory each thing in the IoT. IoT can also make use of other technologies such as barcodes as well.
Example methods, apparatus, and products for clustering devices in the IoT in accordance with the present invention are described with reference to the accompanying drawings, beginning with FIG. 1. FIG. 1 sets forth a block diagram of automated computing machinery comprising an example computer (152) useful in clustering devices in an IoT according to embodiments of the present invention. The computer (152) of FIG. 1 includes at least one computer processor (156) or 'CPU' as well as random access memory (168) ('RAM') which is connected through a high speed memory bus (166) and bus adapter (158) to processor (156) and to other components of the computer (152).
Stored in RAM (168) is a device clustering module (304), a module of computer program instructions for clustering devices in an IoT in accordance with the present invention. The device clustering module 'clusters' Internet connected devices in the sense that the device clustering module (304) associates similar Internet connected devices with a particular cluster group. The device clustering module (304) can cluster Internet connected devices, for example, through the use of a database, table, or other data structure that includes an identifier of a particular device and an identifier of a particular cluster that the device is part of. Clustering similar Internet connected devices within a particular cluster group can enable a larger system to better manage a collection of heterogeneous devices by, for example, imposing usage rules and policies on the set of similar devices that are included in a cluster group, providing access control and security restrictions to the set of similar devices that are included in a cluster group, performing device configuration operations on the set of similar devices that are included in a cluster group, and so on.
The device clustering module (304) of FIG. 1 is configured to receive a characteristic set (302) for a device (300). The characteristic set (302) of FIG. 1specifies one or more device attributes and an attribute value for each device attribute. Devices may be deemed as being similar and therefore subject to clustering in dependence on attributes of the device such as, for example, the type of network connection supported by the device, the communications protocols supported by the device, the manufacturer of the device, different capabilities of the device, and so on. Such device attributes are specified in a characteristic set (302) that specifies one or more device attributes and an attribute value for each device attribute. The characteristic set (302) of FIG. 1may be embodied, for example, as a structured document such as an extensible markup language ('XML') document that includes an identifier of a particular attribute of the device (300) as well as a value for the attribute. Consider the following example of a characteristic set (302) embodied as an XML document:
<ns:deviceData xmlns:ns="urn:foo"> |
||
<ns: attribute id = "1"> |
||
<ns:name>deviceType</ns:name> |
||
<ns:value>MobilePhone</ns:value> |
||
</ns:attribute> |
||
<ns: attribute id = "2"> |
||
<ns:name>manufacturer</ns:name> |
||
<ns:value>Nokia</ns:value> |
||
</ns:attribute> |
||
<ns: attribute id = "3"> |
||
<ns:name>model</ns:name> |
||
<ns:value>N72</ns:value> |
||
</ns:attribute> |
||
</ns:deviceData> |
||
Included in the example characteristic set (302) above are three attributes and values for each attribute that describe the device that the characteristic set (302) represents. The first attribute is a 'deviceType' attribute with a value set to 'MobilePhone' which indicates that the device is a mobile phone. The second attribute is a 'manufacturer' attribute with a value set to 'Nokia' which indicates that the mobile phone is manufactured by Nokia™. The third attribute is a 'model' attribute with a value set to 'N72' which indicates the manufacturer's mobile number for the mobile phone. Readers will appreciate that the example characteristic set (302) set forth above can include a number of additional attributes and values associated with such attributes that can be used to describe additional information about the device that is associated with the characteristic set.
The device clustering module (304) of FIG. 1 is also configured to cluster the device (300) into an attribute level cluster based on the one or more device attributes specified in the characteristic set (302) for the device (300). The attribute level cluster is a cluster of devices that have similar attributes—with no consideration given to the values associated with each attribute. Clustering the device (300) into an attribute level cluster based on the one or more device attributes specified in the characteristic set (302) for the device (300) may therefore be carried out by inspecting the attributes of devices within each attribute level cluster and identifying the attribute level cluster whose devices have attributes that are most similar to the device attributes specified in the characteristic set (302) for the device (300).
Consider an example in which the device is a mobile phone whose characteristic set (302) includes attributes named 'mobileCarrier,' 'areaCode,' 'gpsEnabled,' and 'telecommunicationsWirelessStandard.' Furthermore, assume that a first attribute level cluster includes devices that include attributes named 'mobileCarrier,' 'areaCode,' 'telecommunicationsWirelessStandard,' 'manufacturer,' and 'model.' Additionally, assume that a second attribute level cluster includes devices that include attributes named 'displayType,' 'screenSize,' 'numberOfHDMIPorts,' and 'dolbySoundVersion.' In such an example, the first attribute level cluster appears to be a cluster of mobile phones while the second attribute level cluster appears to be a cluster of televisions.
Clustering the device (300) into an attribute level cluster based on the one or more device attributes specified in the characteristic set (302) for the device (300) may be carried out by inspecting the attributes of devices within each attribute level cluster and identifying the attribute level cluster whose devices have attributes that are most similar to the device attributes specified in the characteristic set (302) for the device (300). In the example described above, the mobile phone whose characteristic set (302) includes attributes named 'mobileCarrier,' 'areaCode,' 'gpsEnabled,' and 'telecommunicationsWirelessStandard' has three attributes in common with the devices in the first attribute level cluster and zero attributes in common with the devices in the second attribute level cluster. As such, the device (300) would be clustered into the first attribute level cluster as the device has attributes that are more similar to the devices in the first attribute level cluster than the devices in the second attribute level cluster.
The device clustering module (304) of FIG. 1 is also configured to cluster the device (300) into a value level cluster based on the attribute value for each device attribute. In the example method of FIG. 1, the value level cluster is a cluster of devices that have similar attributes and also have similar values for identical attributes. In the example of FIG. 1, the value level cluster is a subset of the attribute level cluster.
Consider the example described above in which the device (300) is a mobile phone whose characteristic set (302) includes attributes named 'mobileCarrier,' 'areaCode,' 'gpsEnabled,' and 'telecommunicationsWirelessStandard.' In such an example, assume that the characteristic set (302) includes the following values for each attribute: mobileCarrier=Verizon, areaCode=512, gpsEnabled=yes, and telecommunicationsWirelessStandard=4G. Furthermore, assume that the attribute values of devices in a first value level cluster are: mobileCarrier=Sprint, areaCode=214, gpsEnabled=no, and telecommunicationsWirelessStandard=3G. Additionally, assume that the attribute values of devices in a first value level cluster are: mobileCarrier=Verizon, areaCode=214, gpsEnabled=yes, and telecommunicationsWirelessStandard=4G. In such an example, the values associated with each attribute of the device (300) match zero of the values of devices in the first value level cluster and the values associated with each attribute of the device (300) match three of the values of devices in the second value level cluster. In such an example, the device (300) can be clustered in the second value level cluster as the attribute values for the device (300) are more similar to devices in the second value level cluster than devices in the first value level cluster.
Also stored in RAM (168) is an operating system (154). Operating systems useful clustering devices in an IoT according to embodiments of the present invention include UNIX™, Linux™, Microsoft XP™, AIX™, IBM's i5/OS™, and others as will occur to those of skill in the art. The operating system (154) and device clustering module (304) in the example of FIG. 1 are shown in RAM (168), but many components of such software typically are stored in non-volatile memory also, such as, for example, on a disk drive (170).
The computer (152) of FIG. 1 includes disk drive adapter (172) coupled through expansion bus (160) and bus adapter (158) to processor (156) and other components of the computer (152). Disk drive adapter (172) connects non-volatile data storage to the computer (152) in the form of disk drive (170). Disk drive adapters useful in computers for clustering devices in an IoT according to embodiments of the present invention include Integrated Drive Electronics ('IDE') adapters, Small Computer System Interface ('SCSI') adapters, and others as will occur to those of skill in the art. Non-volatile computer memory also may be implemented for as an optical disk drive, electrically erasable programmable read-only memory (so-called 'EEPROM' or 'Flash' memory), RAM drives, and so on, as will occur to those of skill in the art.
The example computer (152) of FIG. 1 includes one or more input/output ('I/O') adapters (178). I/O adapters implement user-oriented input/output through, for example, software drivers and computer hardware for controlling output to display devices such as computer display screens, as well as user input from user input devices (181) such as keyboards and mice. The example computer (152) of FIG. 1 includes a video adapter (209), which is an example of an I/O adapter specially designed for graphic output to a display device (180) such as a display screen or computer monitor. Video adapter (209) is connected to processor (156) through a high speed video bus (164), bus adapter (158), and the front side bus (162), which is also a high speed bus.
The example computer (152) of FIG. 1 includes a communications adapter (167) for data communications with other computers (182) and for data communications with a data communications network (100). Such data communications may be carried out serially through RS-232 connections, through external buses such as a Universal Serial Bus ('USB'), through data communications networks such as IP data communications networks, and in other ways as will occur to those of skill in the art. Communications adapters implement the hardware level of data communications through which one computer sends data communications to another computer, directly or through a data communications network. Examples of communications adapters useful for clustering devices in an IoT according to embodiments of the present invention include modems for wired dial-up communications, Ethernet (IEEE 802.3) adapters for wired data communications network communications, and 802.11 adapters for wireless data communications network communications.
For further explanation, FIG. 2 sets forth a diagram of attribute level clusters (200, 208) and value level clusters (202, 204,206, 210, 212, 214, 216) according to embodiments of the present invention. In the example of FIG. 2, each cluster (200,202, 204, 206, 208, 210, 212, 214, 216) is a logical grouping of devices. Each attribute level cluster (200, 208) represents a logical grouping of devices based on the device attributes of each device that is included in the attribute level cluster (200,208). In the example of FIG. 2, however, the value associated with device attributes are not a factor in determining which attribute level cluster (200, 208) a device should belong to. That is, devices are clustered into the same attribute level cluster (200, 208) on the basis of having similar attributes as other devices within the attribute level cluster (200, 208)—even if the values of the attributes are different.
In the example of FIG. 2, each value level cluster (202, 204, 206, 210, 212, 214, 216) represents a logical grouping of devices that is not only based on the device attributes of each device that is included in the value level cluster (202, 204,206, 210, 212, 214, 216) but is also based on the value of each attribute for devices that are included in the value level cluster (202, 204, 206, 210, 212, 214, 216). That is, devices are clustered into the same value level cluster (202, 204, 206,210, 212, 214, 216) on the basis of having similar attributes and similar attribute values as other devices within the value level cluster (202, 204, 206, 210, 212, 214, 216).
For further explanation, FIG. 3 sets forth a flow chart illustrating an example method for clustering devices in an IoT according to embodiments of the present invention. The example method of FIG. 3 includes receiving (306), by a device clustering module (304), a characteristic set (302) for a device (300). In the example method of FIG. 3, the device clustering module (304) is a module of computer program instructions that, when executed, is configured to cluster Internet connected devices. The device clustering module (304) 'clusters' Internet connected devices in the sense that the device clustering module (304) associates similar Internet connected devices with a particular cluster group. The device clustering module (304) can cluster Internet connected devices, for example, through the use of a database, table, or other data structure that includes an identifier of a particular device and an identifier of a particular cluster that the device is part of Clustering similar Internet connected devices within a particular cluster group can enable a larger system to better manage a collection of heterogeneous devices by, for example, imposing some usage rules and policies on the set of similar devices that are included in a cluster group, provide access control and security restrictions to the set of similar devices that are included in a cluster group, perform device configuration operations on the set of similar devices that are included in a cluster group, and so on.
In the example method of FIG. 3, the characteristic set (302) specifies one or more device attributes and an attribute value for each device attribute. In the example method of FIG. 3, devices may be deemed as being similar and therefore subject to clustering in dependence on attributes of the device such as, for example, the type of network connection supported by the device, the communications protocols supported by the device, the manufacturer of the device, different capabilities of the device, and so on. In the example method of FIG. 3, such device attributes are specified in a characteristic set (302) that specifies one or more device attributes and an attribute value for each device attribute. The characteristic set (302) ofFIG. 3 may be embodied, for example, as a structured document such as an extensible markup language ('XML') document that includes an identifier of a particular attribute of the device (300) as well as a value for the attribute. Consider the following example of a characteristic set (302) embodied as an XML document:
<ns:deviceData xmlns:ns="urn:foo"> |
||
<ns: attribute id = "1"> |
||
<ns:name>deviceType</ns:name> |
||
<ns:value>MobilePhone</ns:value> |
||
</ns:attribute> |
||
<ns: attribute id = "2"> |
||
<ns:name>manufacturer</ns:name> |
||
<ns:value>Nokia</ns:value> |
||
</ns:attribute> |
||
<ns: attribute id = "3"> |
||
<ns:name>model</ns:name> |
||
<ns:value>N72</ns:value> |
||
</ns:attribute> |
||
</ns:deviceData> |
||
In the example characteristic set (302) above, the characteristic set includes three attributes and values for each attribute that describe the device that the characteristic set (302) represents. The first attribute is a 'deviceType' attribute with a value set to 'MobilePhone' which indicates that the device is a mobile phone. The second attribute is a 'manufacturer' attribute with a value set to 'Nokia' which indicates that the mobile phone is manufactured by Nokia™. The third attribute is a 'model' attribute with a value set to 'N72' which indicates the manufacturer's mobile number for the mobile phone. Readers will appreciate that the example characteristic set (302) set forth above can include a number of additional attributes and values associated with such attributes that can be used to describe additional information about the device that is associated with the characteristic set.
The example method of FIG. 3 also includes clustering (308), by the device clustering module (304), the device (300) into an attribute level cluster based on the one or more device attributes specified in the characteristic set (302) for the device (300). In the example method of FIG. 3, the attribute level cluster is a cluster of devices that have similar attributes—with no consideration given to the values associated with each attribute. Clustering (308) the device (300) into an attribute level cluster based on the one or more device attributes specified in the characteristic set (302) for the device (300) may therefore be carried out by inspecting the attributes of devices within each attribute level cluster and identifying the attribute level cluster whose devices have attributes that are most similar to the device attributes specified in the characteristic set (302) for the device (300).
Consider an example in which the device is a mobile phone whose characteristic set (302) includes attributes named 'mobileCarrier,' 'areaCode,' 'gpsEnabled,' and 'telecommunicationsWirelessStandard.' Furthermore, assume that a first attribute level cluster includes devices that include attributes named 'mobileCarrier,' 'areaCode,' 'telecommunicationsWirelessStandard,' 'manufacturer,' and 'model.' Additionally, assume that a second attribute level cluster includes devices that include attributes named 'displayType,' 'screenSize,' 'numberOfHDMIPorts,' and 'dolbySoundVersion.' In such an example, the first attribute level cluster appears to be a cluster of mobile phones while the second attribute level cluster appears to be a cluster of televisions.
Clustering (308) the device (300) into an attribute level cluster based on the one or more device attributes specified in the characteristic set (302) for the device (300) may be carried out by inspecting the attributes of devices within each attribute level cluster and identifying the attribute level cluster whose devices have attributes that are most similar to the device attributes specified in the characteristic set (302) for the device (300). In the example described above, the mobile phone whose characteristic set (302) includes attributes named 'mobileCarrier,' 'areaCode,' 'gpsEnabled,' and 'telecommunicationsWirelessStandard' has three attributes in common with the devices in the first attribute level cluster and zero attributes in common with the devices in the second attribute level cluster. As such, the device (300) would be clustered into the first attribute level cluster as the device has attributes that are more similar to the devices in the first attribute level cluster than the devices in the second attribute level cluster.
The example method of FIG. 3 also includes clustering (310), by the device clustering module (304), the device (300) into a value level cluster based on the attribute value for each device attribute. In the example method of FIG. 3, the value level cluster is a cluster of devices that have similar attributes and also have similar values for identical attributes. In the example of FIG. 3, the value level cluster is a subset of the attribute level cluster.
Consider the example described above in which the device (300) is a mobile phone whose characteristic set (302) includes attributes named 'mobileCarrier,' 'areaCode,' 'gpsEnabled,' and 'telecommunicationsWirelessStandard.' In such an example, assume that the characteristic set (302) includes the following values for each attribute: mobileCarrier=Verizon, areaCode=512, gpsEnabled=yes, and telecommunicationsWirelessStandard=4G. Furthermore, assume that the attribute values of devices in a first value level cluster are: mobileCarrier=Sprint, areaCode=214, gpsEnabled=no, and telecommunicationsWirelessStandard=3G. Additionally, assume that the attribute values of devices in a first value level cluster are: mobileCarrier=Verizon, areaCode=214, gpsEnabled=yes, and telecommunicationsWirelessStandard=4G. In such an example, the values associated with each attribute of the device (300) match zero of the values of devices in the first value level cluster and the values associated with each attribute of the device (300) match three of the values of devices in the second value level cluster. In such an example, the device (300) can be clustered in the second value level cluster as the attribute values for the device (300) are more similar to devices in the second value level cluster than devices in the first value level cluster.
For further explanation, FIG. 4 sets forth a flow chart illustrating an example method for clustering devices in an IoT according to embodiments of the present invention. The example method of FIG. 4 is similar to the example method of FIG. 3 as it also includes receiving (306) a characteristic set (302) for a device (300), clustering (308) the device (300) into an attribute level cluster, and clustering (310) the device (300) into a value level cluster.
In the example method of FIG. 4, clustering (308) the device (300) into an attribute level cluster can include calculating (402), by the device clustering module (304), a commonality index between the device (300) and a cluster representative for each attribute level cluster. In the example of FIG. 4, the cluster representative specifies a union of all attributes of each device in the attribute level cluster. For example, if a first device in the attribute level cluster includes attributes A, B, and C, while a second device in the attribute level cluster includes attributes B, C, and D, the cluster representative for the attribute level cluster would include attributes A, B, C, and D.
In the example method of FIG. 4, calculating (402), by the device clustering module (304), a commonality index between the device (300) and a cluster representative for each attribute level cluster can be carried out, for example, by determining the number of attributes that the device (300) has in common with the cluster representative for an attribute level cluster, by determining the number of attributes that the device (300) has that are not in common with the cluster representative for an attribute level cluster, by determining the percentage of attributes that the device (300) has in common with the cluster representative for an attribute level cluster, and so on.
In the example method of FIG. 4, calculating (402) a commonality index between the device (300) and a cluster representative for each attribute level cluster can include determining (404), by the device clustering module (304), the number of device attributes specified in the characteristic set (302) for the device (300) that are also attributes of the cluster representative. Consider an example in which the device (300) has attributes A, B, and C. Furthermore, assume that a cluster representative for a first attribute level cluster has attributes, A, D, E, and F, while a cluster representative for a second attribute level cluster has attributes B, C, and D. In such an example, the device (300) has one attribute in common with the first attribute level cluster and two attributes in common with the second attribute level cluster.
In the example method of FIG. 4, calculating (402) a commonality index between the device (300) and a cluster representative for each attribute level cluster can also include determining (406), by the device clustering module (304), the number of device attributes specified in the characteristic set (302) for the device (300) that are not attributes of the cluster representative. Consider the example described above in which the device (300) has attributes A, B, and C, a cluster representative for a first attribute level cluster has attributes, A, D, E, and F, and a cluster representative for a second attribute level cluster has attributes B, C, and D. In such an example, the device (300) has two attributes that are not attributes in the first attribute level cluster and the device (300) has one attribute that is not an attribute of second attribute level cluster.
In the example method of FIG. 4, calculating (402) a commonality index between the device (300) and a cluster representative for each attribute level cluster can also include dividing (408), by the device clustering module (304), the number of device attributes specified in the characteristic set (302) for the device (300) that are also attributes of the cluster representative by the sum of the number of device attributes specified in the characteristic set (302) for the device (300) that are also attributes of the cluster representative and the number of device attributes specified in the characteristic set (302) for the device (300) that are not attributes of the cluster representative. Consider the example described above in which the device (300) has one attribute (A) in common with a first attribute level cluster and two attributes (B and C) that are not contained by any device in the first attribute level cluster. In such an example, the commonality index between the device (300) and the first attribute level cluster is 1/(2+1)=1/3. In the same example, the device (300) has two attributes (B and C) in common with a second attribute level cluster and one attribute (A) that are is contained by any device in the second attribute level cluster. In such an example, the commonality index between the device (300) and the second attribute level cluster is 2/(1+2)=2/3.
Although the example described above illustrates an embodiment in which all attributes are weighted evenly, readers will appreciate that in alternative embodiments attributes may not be weighted equally. In the example described above, in alternative embodiments it may be deemed that attribute A is the most critical attribute of the device (300) while attributes B and C are less important. In such an example, calculating (402) a commonality index between the device (300) and a cluster representative for each attribute level cluster may take into account the relative importance of each attribute such that the commonality index is generated using a formula in which greater importance is placed on finding an attribute level cluster whose devices include attribute A while less importance if placed on finding an attribute level cluster whose devices include attributes B and C.
In the example method of FIG. 4, clustering (308) the device (300) into an attribute level cluster can also include assigning (410), by the device clustering module (304), the device (300) to the attribute level cluster with the highest commonality index. Continuing with the example described above, the device (300) would be assigned to the second attribute level cluster by virtue of the device (300) having a commonality index of 2/3 with the second attribute level cluster while the device (300) only has a commonality index of 1/3 with the first attribute level cluster. In the example of FIG. 4, assigning (410) the device (300) to the attribute level cluster with the highest commonality index may be carried out, for example, by updating an attribute level cluster table or other data structure that associates an attribute level cluster with the devices that are included in the attribute level cluster. Table 1 illustrates an example of an attribute level cluster table:
E, F |
||||||||||||||
In the example illustrated in Table 1, two attribute level clusters are identified: an attribute level cluster with a cluster identifier of '1' and an attribute level cluster with a cluster identifier of '2.' The attribute level cluster with a cluster identifier of '1' includes three devices: a device with a device identifier of '1,' a device with a device identifier of '2,' and a device with a device identifier of '3.' The attribute level cluster with a cluster identifier of '2' includes four devices: a device with a device identifier of '4,' a device with a device identifier of '5,' a device with a device identifier of '6,' and a device with a device identifier of '7.' In addition, the attribute level cluster table above also identifies the attributes for each device specified in Table 1.
For further explanation, FIG. 5 sets forth a flow chart illustrating an example method for clustering devices in an IoT according to embodiments of the present invention. The example method of FIG. 5 is similar to the example method of FIG. 3 as it also includes receiving (306) a characteristic set (302) for a device (300), clustering (308) the device (300) into an attribute level cluster, and clustering (310) the device (300) into a value level cluster. In the example method of FIG. 5, clustering (310) the device (300) into a value level cluster can include calculating (502) for each value level cluster, by the device clustering module (304), a similarity index between the device (300) and all devices in the value level cluster. Calculating (502), for each value level cluster, a similarity index between the device (300) and all devices in the value level cluster may be carried out by calculating a similarity index between the device (300) and each device in every value level cluster. After calculating a similarity index between the device (300) and each device in every value level cluster, the device clustering module (304) may identify the devices that are in a particular value level cluster, sum up the similarity index between the device (300) and each device in the particular value level cluster, and divide that sum by the number of devices in the particular value level cluster to identify an average similarity index between the device (300) and the devices in the particular value level cluster.
Consider an example in which the similarity index between the device (300) and devices in a first value level cluster are 0.1, 0.2, and 0.3. In such an example, the similarity index between the device (300) and the devices in the first value level cluster is (0.1+0.2+0.3)/3=0.2. In the same example, assume that the similarity index between the device (300) and devices in a second value level cluster are 0.1, 0.2, 0.3, and 0.3. In such an example, the similarity index between the device (300) and the devices in the second value level cluster is (0.1+0.2+0.3+0.3)/4=0.225.
In the example method of FIG. 5, clustering (310) the device (300) into a value level cluster can also include assigning (504), by the device clustering module (304), the device (300) to the value level cluster with the highest average similarity index. Consider the example describe above in which the similarity index between the device (300) and the devices in the first value level cluster is 0.2 and the similarity index between the device (300) and the devices in the second value level cluster is 0.225. In such an example, the device (300) would be assigned (504) to the second value level cluster because the similarity index between the device (300) and devices in the second value level cluster is higher that the similarity index between the device (300) and devices in the first value level cluster.
In the example of FIG. 5, assigning (504) the device (300) to the value level cluster with the highest average similarity index may be carried out, for example, by updating an value level cluster table or other data structure that associates a value level cluster with the devices that are included in the value level cluster. Table 2 illustrates an example of a value level cluster table:
In the example illustrated in Table 2, five value level clusters are identified: a value level cluster with a cluster identifier of '1,' a value level cluster with a cluster identifier of '2,' a value level cluster with a cluster identifier of '3,' a value level cluster with a cluster identifier of '4,' and a value level cluster with a cluster identifier of '5.' The value level cluster with a cluster identifier of '1' includes one device identified by a device identifier of '1' and is part of an attribute level cluster identified by an attribute level cluster identifier of '1.' The value level cluster with a cluster identifier of '2' includes one device identified by a device identifier of '2,' a second device identified by a device identifier of '3,' and is also part of an attribute level cluster identified by an attribute level cluster identifier of '1.' The value level cluster with a cluster identifier of '3' includes one device identified by a device identifier of '4' and is also part of an attribute level cluster identified by an attribute level cluster identifier of '1.' The value level cluster with a cluster identifier of '4' includes one device identified by a device identifier of '5,' a second device identified by a device identifier of '6,' and is part of an attribute level cluster identified by an attribute level cluster identifier of '2.' The value level cluster with a cluster identifier of '5' includes one device identified by a device identifier of '7' and is also part of an attribute level cluster identified by an attribute level cluster identifier of '2.'
For further explanation, FIG. 6 sets forth a flow chart illustrating an example method for clustering devices in an IoT according to embodiments of the present invention. The example method of FIG. 6 is similar to the example method of FIG. 5 as it also includes receiving (306) a characteristic set (302) for a device (300), clustering (308) the device (300) into an attribute level cluster, clustering (310) the device (300) into a value level cluster, including calculating (502) for each value level cluster a similarity index between the device and all devices in the value level cluster and assigning (504) the device to the value level cluster with the highest average similarity index.
In the example method of FIG. 6, calculating (502) for each value level cluster a similarity index between the device and all devices in the value level cluster also includes, for each value level cluster, calculating (602) the similarity index between the device (300) and each of the devices in the value level cluster. In the example of FIG. 6, calculating (602) the similarity index between the device (300) and each of the devices in the value level cluster is carried out by the device clustering module (304). Calculating (602) the similarity index between the device (300) and each of the devices in the value level cluster can carried out, for example, by comparing the values for attributes of the device (300) to values for the same attributes of devices with the value level cluster.
Consider an example in which the device (300) includes attributes A, B, and C, and a device in the value level cluster included attributes A, B, and D. In such an example, the value associated with attribute A of the device (300) can be compared to the value associated with attribute A of the device in the value level cluster. Likewise, the value associated with attribute B of the device (300) can be compared to the value associated with attribute B of the device in the value level cluster. If the values are identical, or match within a predefined threshold, the similarity index between the device (300) and the device in the value level cluster is increased. In the example method of FIG. 6, the similarity index can be increased according to a weighting scheme in which certain attributes are weighted more heavily than others.
In the example method of FIG. 6, calculating (502) for each value level cluster a similarity index between the device and all devices in the value level cluster also includes, for each value level cluster, summing (604) the similarity index between the device (300) and each of the devices in the value level cluster, thereby producing a similarity index sum. In the example ofFIG. 6, summing (604) the similarity index between the device (300) and each of the devices in the value level cluster is carried out by the device clustering module (304). Consider an example in which the similarity index between the device (300) and devices in a first value level cluster are 0.1, 0.2, and 0.3. In such an example, the similarity index sum between the device (300) and devices in the first value level cluster is 0.1+0.2+0.3=0.6. In the same example, assume that the similarity index between the device (300) and devices in a second value level cluster are 0.1, 0.2, 0.3, and 0.3. In such an example, the similarity index sum is 0.1+0.2+0.3+0.3=0.9.
In the example method of FIG. 6, calculating (502) for each value level cluster a similarity index between the device and all devices in the value level cluster also includes, for each value level cluster, determining (606) the number of devices in the value level cluster. In the example method of FIG. 6, determining (606) the number of devices in the value level cluster is carried out by the device clustering module (304). The device clustering module (304) may determine the number of devices in the value level cluster, for example, by scanning a value level cluster table as illustrated above with reference to Table 2.
In the example method of FIG. 6, calculating (502) for each value level cluster a similarity index between the device and all devices in the value level cluster also includes, for each value level cluster, dividing (608) the similarity index sum by the number of devices in the value level cluster. Consider the example described above in which the similarity index sum between the device (300) and devices in the first value level cluster is 0.6 and the similarity index between the device (300) and devices in a second value level cluster is 0.9. The first value level cluster includes 3 devices such that dividing (608) the similarity index sum by the number of devices in the first value level cluster produces an average similarity index of 0.6/3=0.2. The second value level cluster includes 4 devices such that dividing (608) the similarity index sum by the number of devices in the first value level cluster produces an average similarity index of 0.9/4=0.225.
The weights of an attribute can be a function of its importance, entropy, dispersion, and so on. Let w={w1, w2, w3, . . . , wn} be the set of weights of attributes where wi is the weight of attribute ai. In such an example, the weight are normalized such that such that the sum of all weights in the set of weights is equal to 1. Between two devices, Di and Dj, the common set of attribute-value pairs between the two devices is characterized as the intersection of the attribute-value set for Di and the attribute-value set for Dj. Furthermore, the common set of attributes between the two devices is characterized as the intersection of the attribute set for Di and the attribute set for Dj. In such an example, if attribute-value pair exists in each device, the attribute-value pair is part of the common set of attribute-value pairs denoted as Zij for devices Di and Dj. If, however, an attribute is shared by each device but the value associated with the similar attributes does not match, the attribute is part of the common set of attributes denoted as Z′ij for devices Di and Dj. In such an example, the similarity index between the two devices Di and Dj can be calculated as (α+β) times the weighted sum of the attributes that are in the common set of attribute-value pairs denoted as Zij plus α times the weighted sum of the attributes that are in the common set of attributes denoted as Z′ij. In such an example, α+β are two parameters that control the importance given to a match of an attribute and a match of its corresponding value. Thinking of this example as instituting a points system, if there is only an attribute match then only α points are awarded whereas if the there is both a match of an attribute and its corresponding value, then α+β points are awarded under the precondition that α+β=1.
SRC=https://www.google.com.hk/patents/US20130173621
Clustering Devices In An Internet Of Things的更多相关文章
- About SQLite
About SQLite See Also... Features When to use SQLite Frequently Asked Questions Well-known Users Boo ...
- Overview and Evaluation of Bluetooth Low Energy: An Emerging Low-Power Wireless Technology
转自:http://www.mdpi.com/1424-8220/12/9/11734/htm Sensors 2012, 12(9), 11734-11753; doi:10.3390/s12091 ...
- Checklist For Choosing The Right Database Engine
http://sqlite.org/whentouse.html Appropriate Uses For SQLite SQLite is not directly comparable to cl ...
- Security Checklist (路由器安全checklist)
Security Checklist Website by Michael Horowitz Home | Introduction | Router Bugs | Security Che ...
- ios7 sdk 新特性
iOS 7 is a major update with compelling features for developers to incorporate into their apps. The ...
- What is SaaS?
SaaS, or Software as a Service, describes any cloud service where consumers are able to access softw ...
- IOT数据库选型——NOSQL,MemSQL,cassandra,Riak或者OpenTSDB,InfluxDB
IoT databases should be as flexible as required by the application. NoSQLdatabases -- especially key ...
- (Java学习笔记) Java Networking (Java 网络)
Java Networking (Java 网络) 1. 网络通信协议 Network Communication Protocols Network Protocol is a set of rul ...
- darktrace 亮点是使用的无监督学习(贝叶斯网络、聚类、递归贝叶斯估计)发现未知威胁——使用无人监督 机器学习反而允许系统发现罕见的和以前看不见的威胁,这些威胁本身并不依赖 不完善的训练数据集。 学习正常数据,发现异常!
先说说他们的产品:企业免疫系统(基于异常发现来识别威胁) 可以看到是面向企业内部安全的! 优点整个网络拓扑的三维可视化企业威胁级别的实时全局概述智能地聚类异常泛频谱观测 - 高阶网络拓扑;特定群集,子 ...
随机推荐
- HttpClient请求发送的几种用法二:
public class HttpClientHelper { private static readonly HttpClientHelper _instance = new H ...
- 7 Java Performance Metrics to Watch After a Major Release--转
原文地址:https://dzone.com/articles/7-java-performance-metrics-to-watch-after-a-major-1 The Java perform ...
- python中如何在一张图上画两条折线
摘自:https://segmentfault.com/q/1010000002760775
- ZOJ 2405 Specialized Four-Digit Numbers
http://acm.zju.edu.cn/onlinejudge/showProblem.do?problemId=1405 要求找出4位数所有10进制.12进制.16进制他们各位数字之和相等. # ...
- Vim技巧之四大模式_插入模式
Vim技巧之四大模式_插入模式 在插入模式中及时更正错误 插入-普通模式 在插入模式模式以下直接粘贴指定寄存器的内容 插入模式中做运算 用字符编码插入很常使用字符 替换已有的文本 Vim技巧之四大模式 ...
- hadoop集群中的日志文件 分类: A1_HADOOP 2015-02-28 20:37 680人阅读 评论(0) 收藏
hadoop存在多种日志文件,其中master上的日志文件记录全面信息,包括slave上的jobtracker与datanode也会将错误信息写到master中.而slave中的日志主要记录完成的ta ...
- Need ffmpeg exe. You can download it by calling: imageio.plugins.ffmpeg.download()
该问题 Need ffmpeg exe. You can download it by calling: imageio.plugins.ffmpeg.download()往往出现在在调用 impor ...
- java开发中序列化与反序列化起到的作用
基本概念: 序列化是将对象状态转换为可保持或传输的格式的过程.与序列化相对的是反序列化,它将流转换为对象. 这两个过程结合起来,能够轻松地存储和数据传输. 特别在网络传输中,它的作用显得尤为重要.我们 ...
- Hadoop配置文件 分类: A1_HADOOP 2014-08-19 12:48 1157人阅读 评论(1) 收藏
部分内容参考:http://www.linuxqq.net/archives/964.html http://slaytanic.blog.51cto.com/2057708/1100974/ ha ...
- Tomcat结合Apache、Nginx实现高性能的web服务器
一.Tomcat为什么需要与apache.nginx一起结合使用? Tomcat虽然是一个servlet和jsp容器,但是它也是一个轻量级的web服务器.它既可以处理动态内容,也可以处理静态内容.不过 ...