A Visual Computing Unified Application Using Deep Learning and Computer Vision Techniques

Authors

  • Sowmya B. J. M S Ramaiah Institute of Technology
  • Meeradevi Ramaiah Institute of Technology, Bengaluru, India
  • S. Seema Professor in Computer Science and Engineering Department, M S Ramaiah Institute of Technology, Bangalore, India
  • Dayananda P Department of Information Technology, Manipal Institute of Technology Bengaluru, Manipal Academy of Higher Education, Manipal, India https://orcid.org/0000-0001-8445-3469
  • Supreeth S. REVA University https://orcid.org/0000-0002-7097-6733
  • Shruthi G. REVA University https://orcid.org/0000-0002-8185-8134
  • S. Rohith Department of Electronics & Communication Engineering, Nagarjuna College of Engineering & Technology, Bengaluru, India https://orcid.org/0000-0001-5709-9492

DOI:

https://doi.org/10.3991/ijim.v18i01.42673

Keywords:

Deep Learning, Convolution Neural Networks, Computer Vision Techniques, Visual Computing

Abstract


Vision Studio aims to utilize a diverse range of modern deep learning and computer vision principles and techniques to provide a broad array of functionalities in image and video processing. Deep learning is a distinct class of machine learning algorithms that utilize multiple layers to gradually extract more advanced features from raw input. This is beneficial when using a matrix as input for pixels in a photo or frames in a video. Computer vision is a field of artificial intelligence that teaches computers to interpret and comprehend the visual domain. The main functions implemented include deepfake creation, digital ageing (de-ageing), image animation, and deepfake detection. Deepfake creation allows users to utilize deep learning methods, particularly autoencoders, to overlay source images onto a target video. This creates a video of the source person imitating or saying things that the target person does. Digital aging utilizes generative adversarial networks (GANs) to digitally simulate the aging process of an individual. Image animation utilizes first-order motion models to create highly realistic animations from a source image and driving video. Deepfake detection is achieved by using advanced and highly efficient convolutional neural networks (CNNs), primarily employing the EfficientNet family of models.

Downloads

Published

2024-01-12

How to Cite

J., S. B., Meeradevi, Seema, S., P, D., S., S., G., S., & Rohith, S. (2024). A Visual Computing Unified Application Using Deep Learning and Computer Vision Techniques. International Journal of Interactive Mobile Technologies (iJIM), 18(01), pp. 59–74. https://doi.org/10.3991/ijim.v18i01.42673

Issue

Section

Papers