Professional Documents
Culture Documents
Anticipation of Forged
orged Video Evidence using Machine
achine Learning
G. Abinaya1, K. Sridevi Nattar2, Dr. Rajini Girinath3
1, 2
UG Student, 3Professor
Department of Computer Science and Engineering
Engineering,
Sri Muthukumaran Institute of Technology, Chennai, India
ABSTRACT I. INTRODUCTION
To detect audio manipulation in a pre recorded In this digital era, with highly sophisticated video
evidence videos by developing a synchronization editing tools, forgery videos can be submitted as
verification algorithm to match the lip movements evidence. Cyber forensics is a special field to
along with its audio pitch values. Audio video manually verify the audio lip and speech
recognition
ion has been considered as a key for speech synchronizations. Submitting a manipulated video is a
recognition tasks when the audio is sullied, as well as serious offence, which may bias the result of
visual recognition method used for speaker judgment.The
The method of Audio Video Recognition
authentication in multispeaker scenarios. The primary systems is to influence the extracted information from
aim of this paper is to point out the correspondence one modality to surge the recognition ability of the
betweenn the audio and video streams. Acquired audio other
ther modality by complementing the missing
feature sequences are processed with a Gaussian data.Audio
Audio video speech recognition is a method that
model. [1].This proposed method achieves parallel used image processing capabilities in lip reading to
processing by effectively examining multiple videos assist speech recognition systems in recognizing
at a time.In this paper, we train the machine by undeterministic phones or giving predominance
convolutionalal neural network (CNN) and deep neural among near probability decisions. The hardest part of
network (DNN).CNN architecture maps both the an AVR algorithm is the feature selection for both the
modalities into a depiction space to evaluate the audio and visual modalities which in turn directly
correspondence of audio –visual
visual streams using the creates impact on the AVR task.
task The modules in the
learned multimodal features. DNN is used as a lip reading and speech recognition work individually
discriminative model betweeneen the two modalities in and their results are merged at the stage of feature
order to concurrently distinguish between the fusion.
correlated and uncorrelated components. The
proposed architecture will deploy both spatial and In the area of machine learning, deep learning
temporal information jointly to effectively discover approaches have recently fascinated increasing
the correlation between temporal inf information for attention because deep neural networks can
different modalities. We train a system by capturing effectively pull out robust latent features that enable
the motion picture. This method achieves relative various recognition algorithms to reveal revolutionary
enhancement over 20% on the equal error rate and 7% generalization capabilities under diverse application
on the average precision in comparison to the state of circumstances. According to the speech modality,
the art method. speech recognition systems uses Hidden Markov
Models (HMMs) to extract all the temporal
KEYWORDS: Convolutional neural al network, Audio
Audio- information of speech and Gaussian mixture Models
visual recognition, Deep neural network (GMMs) to separate between different HMMs states
for acoustic input representation. A neural training
algorithm is used to automatic the human intelligence
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 3 | Mar-Apr 2018 Page: 1430
International Journal of Trend in Scientific Research and Development (IJTSRD) ISSN: 2456-6470
2456
than the angle which is being trained, machine itself parameters of an autoregressive model of the signal
will add or minus 90o from the video. [4]
Machine Learning
earning is an interdisciplinary field
involving programs that improve by experience. It is
good for pattern recognition, object extraction and
colour classification etc. problems in image
processing problem domain. Machine learning
constructs computer programs that develop solutions
and improve with experience. It solves
olves problems
which cannot
not be solved by enumerative methods or
calculus-based techniques. Machine learning is a part
of Artificial Intelligence. Machines are trained with
data sets. Machine learning objectives is to develop
the computer algorithms which can acquire
experience fromom example inputs and make data data-
driven estimates on unknown test data. Such
algorithm can be divided into two types. They are
supervised and unsupervised learning. They are
complementary to each other. Meanwhile supervised
learning forces labels of inputs which are significant
to human, it is easy to apply this kind of learning
algorithms to pattern classification and data
regression problems. Fig4: Flow of video
@ IJTSRD | Available Online @ www.ijtsrd.com | Volume – 2 | Issue – 3 | Mar-Apr 2018 Page: 1433