Weapon Detection

Weapon detection is the need of today. It plays a crucial role in many applications, such as hostage scenes, surveillance of sensitive areas, anti-terrorist operations, etc. To make the weapon detection model more efficient, we introduce a weapon dataset, named IITP-W, that captures the following properties: a) images depicting real-world scenarios, including complex backgrounds, diverse lighting conditions, object occlusions, and varying image resolutions, b) images having large and small weapons, c) absence of images sharing identical information, and d) exclusion of synthetic images. The dataset includes three types of weapons: (1) Short gun (2) long guns and (3) knife. The IITP-W dataset consists of 4292 instances of short guns, 1047 instances of knifes and 5447 instances of long guns with complex backgrounds, varied sizes, different lightening conditions and different resolutions. The short gun category includes images of real guns belonging to 30 different types in different firing statuses. Similarly, long gun category includes images of real gun belonging 61 different types. Figure 2 depicts images from existing and proposed datasets, highlighting the differences. Additionally, Table 1 furnishes details such as data size, image count with plain backgrounds and synthetic images for both existing and proposed datasets. The dataset is available for download at: www.iitp.ac.in/~halder/Prj/Dataset/IITP-W.rar

Dataset Collection

The IITP-W dataset is assembled by two means. Firstly, by taking images of weapons by simulating several crime scenes at various locations. A group of people willingly took part as volunteers to play the roles of victims and criminals. These volunteers were asked to hold mock weapons. The photos taken from varied angles consist of 84% low resolution (in the range of 0.01 to 1 megapixels) and 4% high-resolution images (in the range of 5 or more megapixels), as shown in Figure 3.


Secondly, we collect images from a number of movies containing crime scenes and CCTV clips of real criminal activities from various online resources. Initially, the parts of movies containing weapons in various crime scenes are extracted using the Filmora Tool: https://filmora.wondershare.com. Further, we extract relevant images from these videos using a Python script. Through this process, we generate a total of 9760 images. Figure 4 shows the count of images in our dataset based on types of weapons and their counts. 

Figure 3: Count of images of different resolutions. The x-axis shows a range of image resolutions, and each bar shows the count of images for a particular range of image resolutions. The graph shows that 84% of the data has an image resolution between 0.001 to 1-megapixel.

Figure 4: The Figure shows the categorization of images based on the number of weapons present in each image.Â