Implementation of Spatio-Temporal Background Subtraction Model

Waribam Jotin Singh; Ms. Anni Susan Basu

doi:10.17577/IJERTCONV3IS19170

ICESMART - 2015 (Volume 3 - Issue 19)

Implementation of Spatio-Temporal Background Subtraction Model

DOI : 10.17577/IJERTCONV3IS19170

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 86
Total Downloads : 17
Authors : Waribam Jotin Singh, Ms. Anni Susan Basu
Paper ID : IJERTCONV3IS19170
Volume & Issue : ICESMART – 2015 (Volume 3 – Issue 19)
Published (First Online): 24-04-2018
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Implementation of Spatio-Temporal Background Subtraction Model

Waribam Jotin Singh

IV Sem , M.tech DCN branch

John Institute of Technology Bangalore, India

Ms Anni Susan Basu

Assistant professor, Dept. of ECE
1. John Institute of Technology Bangalore, India
  
  Abstract – Background subtraction continues to be in open drawback in the context of advanced situations like dynamic backgrounds, illumination variations, and bleary foreground objects. To overcome these challenges, we tend to propose a good background subtraction method by mean of spatial- temporal representations. Within the experiments, I validate the projected methodology in many advanced situations, and show superior performances over different progressive approaches of background subtraction. There area a several challenges in developing a decent background subtraction algorithmic rule. First, it should be strong against changes in illumination. Second, it ought to avoid background objects like swinging leaves, rain, snow, and shadow forged by moving objects. Finally, its internal background model ought to react quickly to changes in background like beginning and stopping of vehicles. During this project, I tend to compare varied background sub-traction algorithms for police investigating moving vehicles and pedestrians in urban trace video sequences. I tend to contemplate variable approaches from easy techniques like frame defacing and adaptive median filtering, to more sophisticated probabilistic modelling techniques.
  
  Keywords – Spatio-temporal representation, video bricks, background subtraction.
  1. INTRODUCTION
    
    Background subtraction or foreground extraction has been extensively studied for years, yet it still remains an open challenge due to the real time applications. Background subtraction is a common method for detecting moving objects, and it has been widely used in many surveillance systems. Moving object detection by mean of background subtraction, compares an input image with a background model previously prepared, and picks up regions in an input image, which do not match a background mode. Moving object detection with background subtraction
    
    has the advantage of not requiring previous knowledge of moving objects. On the other hand, background subtraction has a problem in that it
    
    Fig1: Flow diagram for a background subtraction algorithm
    
    Cannot, discriminate moving objects from backgrounds when these backgrounds change significantly. Video surveillance systems tend to automatically identify events of interest in a variety of circumstances. Example applications include intrusion detection, activity monitoring, and pedestrian counting. The capability of extracting moving objects from a video sequence is a fundamental and crucial problem of these vision systems. For systems using static cameras, background subtraction is the method typically used to segment moving regions in the image sequences, by comparing each new frame to a model of the scene background.
    
    In our proposed method, we tend to learn and maintain the dynamic models within spatio-temporal video patches (i.e. video bricks), accounting surveillance scenarios for real challenges. The algorithm can process 15 20 frames per second in the resolution 352 Ã— 288 (pixels) on average. We briey overview the proposed framework of background modelling in following aspects:
    1. Spatio-Temporal Representations: We represent the observed scene by video bricks, i.e. video volumes spanning over both
      
      spatial and temporal domain, for to model spatial and temporal information jointly. Specically, for every location in the scene, sequence of video bricks are extracted, using which we can learn and update the background models. Moreover, to compactly encode the video bricks against illumination variations, we design a brick- based descriptor, call Center Symmetric Spatio-Temporal Local Ternary Pattern (CS-STLTP), which is inspired by the 2D scale invariant local pattern operator. Its effectiveness is also validated in the experiments.
    2. Pursuing Dynamic Subspaces: We treat each video bricks sequence at a certain location as a consecutive signal, and generate the subspace within these video bricks. The linear dynamic system (i.e. Auto Regressive Moving Average, ARMA model) is adopted to characterize the spatio-temporal statistics of the subspace. Specically, with the observed video bricks, we can express them by a data matrix, in which each column contains the feature of a video brick. The basis vectors (i.e. eigenvectors) of the matrix can be then analytically estimated, representing the appearance parameters of the subspace, and the parameters of dynamical variations are further computed based on the xed appearance parameters. It is worth mentioning that our background model jointly captures the information of appearance and motion as the data (i.e. features of the video bricks) are extracted over both spatial and temporal domains.
    3. Maintaining Dynamic Subspaces Online: Given the newly appearing video bricks with our model, moving fore- ground objects are segmented by estimating the residuals within the related subspaces of the scene, while the back- ground models are maintained simultaneously to account for the scene changes. The raising problem is to update parameters of the subspaces incrementally against disturbance
      
      of foreground objects and background noise. The new observation may include noise pixels (i.e. outliers), resulting in degeneration of model updating. Furthermore, one video brick could be partially occluded by foreground objects in our representation, i.e. only some of pixels in the brick are true positives. To overcome this problem, we present a good approach to compensate observations (i.e. the observed video bricks) by generating data from the current models. Specically, we replace the pixels labelled as non-background by the generated pixels to synthesize the new observations. The algorithm for online model updating includes two steps: (i) update appearance parameters using the incremental sub-space learning technique, and (ii) update dynamical variation parameters by analytically solving the linear reconstruction. The experiments show that the proposed method effectively improves the robustness during the online processing.
  2. DYNAMIC SPATIO-TEMPORAL MODEL
    
    In this section, we introduce the background model, and then discuss the video brick representation and our model denition, respectively.
    Let m be the descriptor length for each brick, and V =
    
    {v1,v2,…,vn}, vi Rm be a sequence of video bricks at a certain location of the observed background. We can use a set of bases (components) C = [C1,C2,…,Cd] to represent the subspace where V lies in. Each video brick vi in V can be represented as
    
    d
    
    d
    
    zi , j cj i j 1
    
    Where, Cj is the jth basis (jth column of matrix C) of the subspace, zi,j the coefcient for Cj, and i the appearance residual. We denote C to represent appearance consistency of the sequence of video bricks. In some traditional background models by subspace learning, zi,j can be solved and kept as a constant, with the underlying assumption that the appear- ance of background would be stable within the observations. In contrast, we treat zi,j as the variable term that can be further phrased as the time- varying state, accounting for temporally coherent variations (i.e. the motions).
    
    The proposed model is time-varying, and the parameters Cn, An, Bn can be updated incrementally along with the processing of new observations, in order to adapt our model with scene changes.
  3. LEARNING ALGORITHM
    
    In this section, the learning for spatio-temporal background models is discussed, including initial subspace generation and online maintenance. The initial learning is performed at the beginning of system deployment, when only a few foreground objects move in the scene. Afterwards, the system switches to the mode of online maintenance.
    1. Initial Model Learning
      
      In the initial stage, the model dened in the above equation can be degenerated as a non-dynamic linear system, as the n observations are extracted and xed. When a brick sequence V = {v1, v2,…,vn} is given, an algorithm to identify the model parameters Cn, An, Bn, is presented.
    2. Online Model Maintenance
    Then we discuss the online processing with our model that segments foreground moving objects and keeps the model updated.
    1. Foreground Segmentation:
      
      Given one newly appearing video brick vn+1, we can determine whether pixels in vn+1 belong to the background or not by thresholding their appearance residual and state residual. We first estimate the state of vn+1 with the existing Cn, and further the appearance residual of vn+1.
      
      As the state zn and the temporal coherence An have been solved, we can then estimate the state residual according to system equation.
      
      With the state residual zn+1 and the appearance residual n+1 for the new video brick vn+1, we conduct the following criteria for foreground segmentation, in which two thresholds are introduced.
      1. Vn+1 is classified into background, only if all dimensions of _n are less than a threshold T.
      2. If vn+1 has been labeled as non-background, perform the pixel-wise segmentation by comparing n+1 with a threshold T: the pixel is segmented as foreground if its corresponding dimension in n+1 is greater than T.
    2. Model Updating:
    During the online processing, to deal with foreground disturbance is the key problem for model updating, i.e. to avoid absorbing pixels from foreground objects or noise. In this work, we develop an effective
    
    approach to update the model with the synthesized data. We first generate a video brick from the current model, namely noise-free brick, vn+1, then we extract pixels from vn+1 to compensate occluded (i.e. foreground) pixels in the newly appearing brick. Concretely, the pixels labeled as non-background are replaced by the pixels from the noise- free video brick at the same place. We can thus obtain a synthesized video brick. vn+1 for model updating.
    
    Given the brick, vn+1, the data matrix Wn composed of observed video bricks is extended to Wn+1. The, the model Cn+1 is update.
  4. EXPERIMENTS
    
    For, to perform simulation, we collected a number of challenging videos to validate our approach. Our algorithm had shown a good result in adapting the complex background changes. The total evaluation of our work will be presented after the full implementation.
  5. OUTPUT
    
    Fig 3: Sampled results of background subtraction generated by our approach
    
    The Algorithm for the proposed model is given
    
    below
    
    Algorithm 1: The Sketch of the Proposed Algorithm
    
    Input: Video brick sequence V={} for every location for the scene.
    
    Output: Maintained Background models and foreground regions.
    
    For all the locations for the scene do
    
    Given the observed video bricks V, extract the CS-STLTP descriptor
    
    Initialize the subspace by estimating Cn, An, Bn;
    
    for the newly appearing video brick vn+1 do
    1. Extract the CS-STLTP descriptor for vn+1;
    2. Calculate its state residual and appearances residual;
    3. For each pixel of vn+1, classify it into foreground or background by thresholding the two residuals with ;
    4. Generate the noise-free brick n+1 from the current model;
    5. Synthesize video brick n+1 for model updating;
    6. Update n+1 into n+1 by introducing a robustness function;.
      
      end
    7. Update the new appearance parameter Cn+1 by calculating the covariance matrix Covn+1 with the learning rate ;
    8. Update the state variation parameters An+1,Bn+1;
      
      end
  6. CONCLUSION

By using the above algorithm we have obtain the fig 3 results. The work is still in progress and further work will be done using some edge detection and smoothing filter along with spatio-temporal methodto get a more noise free foreground object detection.

REFERENCES

C. Stauffer and W. Grimson , Adaptive background mixture models for real-time tracking, in Proc. IEEE Conf. CVPR, Jun. 1999.
T. Bouwmans, F. E. Baf, and B. Vachon, Background modeling using mixture of Gaussians for foreground detection-a survey , Recent Patents Comput. Sci.,vol. 1, no. 3, pp. 219237, 2008.
L. Maddalena and A. Petrosino, A self-organizing approach to back- ground subtraction for visual surveillance applications, IEEE Trans. Image Process., vol. 17, no. 7, pp. 11681177, Jul. 2008.
D. M. Tsai and S.-C. Lai, Independent component analysis-based background subtraction for indoor surveillance, IEEE Trans. Image Process., vol. 18, no. 1, pp. 158167, Jan. 2009.
H. Chang, H. Jeong, and J. Choi, Active attentional sampling for speed- up of background subtraction, in Proc. IEEE Conf. CVPR, Jun. 2012, pp. 20882095.
X. Liu, L. Lin, S. Yan, H. Jin, and W. Tao, Integrating spatio- temporal context with multiview representation for object recognition in visual surveillance, IEEE Trans. Circuits Syst. Video Technol., vol. 21, no. 4, pp. 393407, Apr. 2011.
L. Lin, Y. Lu, Y. Pan, and X. Chen, Integrating graph partitioning and matching for trajectory analysis in video surveillance, IEEE Trans. Image Process., vol. 21, no. 12, pp. 48444857, Apr. 2012.

Implementation of Spatio-Temporal Background Subtraction Model

Leave a Reply