Audio tampering detection: Deep learning methodologies for multi-layered threat detection

Dörömbözi, András

doi:10.34726/hss.2023.96575

DC Field

Value

Language

dc.contributor.advisor

Knees, Peter

dc.contributor.author

Dörömbözi, András

dc.date.accessioned

2023-09-21T10:24:37Z

dc.date.issued

2023

dc.date.submitted

2023-09

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Dörömbözi, A. (2023). <i>Audio tampering detection: Deep learning methodologies for multi-layered threat detection</i> [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2023.96575</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2023.96575

dc.identifier.uri

http://hdl.handle.net/20.500.12708/188402

dc.description.abstract

The thesis proposes and evaluates deep learning methodologies for tampering detection at multi-layered audio samples. The tampering of the audio recordings can be performed by cutting, insertion, or reshuffling. State-of-the-art solutions provide promising results by detecting tampering events in audio samples based on acoustic environment analysis, microphone identification, or by analyzing the trace fluctuation of the signal at frequency ranges. However, these solutions are rarely evaluated in research studies for whether they could detect tampering events in recordings, which are post-processed with additional layers. Nonetheless, the layer might be able to hide the tampering traces. Such cases could be problematic, like the post-processing of a tampered politician speech with a music layer, where the tampering might not be detected in time. Such content might be able to harm the trustworthiness of democratic institutions if it reaches many people, which by the rapid growth of social media platforms is already a realistic scenario. The methodologies proposed in this thesis rely on Transformer models, Multi-Layer Perceptron, and Recurrent Neural Networks. Besides developing and applying the proposed methods, the baseline tampering detection approach introduced in a state-of-the-art research paper is also implemented and evaluated against the multi-layered query audios. This evaluation is used to elaborate on how the performance of the baseline model is affected by additional post-processing techniques. The proposed methodologies’ performance are compared against the baseline approach performance to elaborate on which cases the proposed methodologies can provide a better solution and identify the disadvantages and bottlenecks of these solutions. The thesis demonstrated that the proposed approaches can outperform the baseline model, when additional music or environment layers are applied to the recordings. On the other hand, the baseline model noticeably outperforms the proposed methodologies in cases where capturing the ENF signal from the recordings on which the model’s feature extraction relies is optimal.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Audio Tampering Detection

dc.subject

Supervised Learning

dc.subject

Deep Neural Networks

dc.subject

Audio-Signal Processing

dc.subject

Dataset creation

dc.subject

Multi-layer audio

dc.title

Audio tampering detection: Deep learning methodologies for multi-layered threat detection

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2023.96575

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

András Dörömbözi

dc.publisher.place

Wien

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

dc.contributor.assistant

Schindler, Alexander

tuw.publication.orgunit

E194 - Institut für Information Systems Engineering

dc.type.qualificationlevel

Diploma

dc.identifier.libraryid

AC16948814

dc.description.numberOfPages

118

dc.thesistype

Diplomarbeit

dc.thesistype

Diploma Thesis

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.assistant.staffStatus

staff

tuw.advisor.orcid

0000-0003-3906-1292

item.languageiso639-1

item.openairetype

master thesis

item.grantfulltext

open

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_bdcc

item.openaccessfulltext

Open Access

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(2.88 MB)

In Copyright

Show simple item record

Page view(s)

343

checked on Nov 23, 2023

Download(s)

236

checked on Nov 23, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM