Open Source Python Multimedia Software

Browse free open source Python Multimedia Software and projects below. Use the toggles on the left to filter open source Python Multimedia Software by OS, license, language, programming language, and project status.

  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    OpenCV

    OpenCV

    Open Source Computer Vision Library

    The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials! https://docs.opencv.org/master Books about the OpenCV are described here: https://opencv.org/books.html
    Leader badge
    Downloads: 3,480 This Week
    Last Update:
    See Project
  • 2
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    Intention Repeater MAX

    Intention Repeater MAX

    Repeating your Intentions to aid in manifestation

    Please see the README.txt. The ServitorConnect 4443 and Python Daemon and Intention Repeater Android are better because repeating once-per-hour is better than millions of times per second (or even 3Hz). The archive bundle includes binaries and source code for: MAX and Simple Intention Repeaters CUDA version for Windows/Linux Memory Frequency Generator Multi-Format to WAV Repeater Android app Sourcecode File/Image Writers Nesting Files Creator Prayer Wheel Spiritual Chat Tarot WiFi Broadcast Whether you're seeking to manifest abundance, enhance your spiritual journey, or promote overall well-being, Intention Repeater MAX empowers you to harness the incredible strength of intention repetition. Compatible with Windows, macOS, and Linux. Download now and experience the life-changing benefits for yourself! 🙌
    Downloads: 19 This Week
    Last Update:
    See Project
  • 4
    GUIDOLib
    The GUIDOLib provides a powerful engine for the graphic rendering of music scores, based on the Guido Music Notation format. It supports Linux, Mac OS X, Windows, Android and iOS operating systems. A Java JNI interface is available as well as a Javascript version of the library. A Web API has also been designed, allowing to deploy the engine as a Web service.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Deliver trusted data with dbt Icon
    Deliver trusted data with dbt

    dbt Labs empowers data teams to build reliable, governed data pipelines—accelerating analytics and AI initiatives with speed and confidence.

    Data teams use dbt to codify business logic and make it accessible to the entire organization—for use in reporting, ML modeling, and operational workflows.
    Learn More
  • 5

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch Mozaic Many Outputs Physical Inputs Streaming Protocols File Formats Presets Vods/Series server-side support Pay per view channels Channels on demand HTTP Live Streaming (HLS) server-side support Public API, client server communication via JSON RPC Protocol gzip compression Deep learning video analysis Supported deep learning frameworks: Tensorflow NCSDK Caffe ML Hardware:
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6

    ComfortTV

    Video add-on for Kodi

    Add-on for Kodi allowing to view video content provided by Garant TV Gomel
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    CIOS Audio Core

    CIOS Audio Core

    A cross platform audio inteface

    CIOS Audio Core is the infrastructure and subsystem in Collective Intelligence Operation System. It is a cross platforms audio abstract interface, supports including Mac OS X, Linux, Android, iPhone OS and Windows 7.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    EnCodec

    EnCodec

    State-of-the-art deep learning based audio codec

    Encodec is a neural audio codec developed by Meta for high-fidelity, low-bitrate audio compression using end-to-end deep learning. Unlike traditional codecs (like MP3 or Opus), Encodec uses a learned quantizer and decoder to reconstruct complex waveforms with remarkable accuracy at bitrates as low as 1.5 kbps. It employs a convolutional encoder–decoder architecture trained with perceptual loss functions that optimize for human auditory quality rather than raw waveform distance. The model can operate in real time and supports variable bandwidths, bitrates, and multi-band audio. Encodec has applications in speech and music compression, generative modeling, and efficient data transmission for communication systems. The repository includes pretrained checkpoints, PyTorch inference code, and examples for integrating Encodec as a module in downstream generative or streaming systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Mollino

    Mollino

    Not your usual Architectural Modeler

    Why do Archicad and Revit only share 30 or 40% of the global architectural sofware market ? Maybe because lots of small practices find these software unadapted to the way they work, and to their bank account. I think there's place for an open software that doesn't work like the existing. It will be simple in its tools and processes, and will have infinite capabilities. This will be possible by defining a logical structure. As I am an architect, and I know very little about programming, and wouldn't reach even in 10 years the necessary level to be able to write anything useful for this type of software, my part will be to bring ideas and coherence to this project. If you want to know some more : - go to the blog https://sourceforge.net/p/mollino/blog/ - write me a message ;) I speak french, german, english and italian. Martin Lucas
    Downloads: 0 This Week
    Last Update:
    See Project
  • Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software. Icon
    Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.

    Banks, lending institutions

    Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
    Learn More
  • 10
    MusiqBox

    MusiqBox

    An social musicsystem for your Pi and smartphone.

    MusiqBox is an client-server based audio player, which let you stream music files from any device to an server and plays it. You are also able to store music on the server itself and manage it with your smartphone or desktop. The aim of MusiqBox is to extend the functionality of an existing audio-system (e.g. common stereo) with this features: -Playback of common audio-formats like: mp3, ogg, flac, wave,… -Stream music files over the network -Store music files locally -Remote control -Playlist support -…
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PyShelf

    PyShelf

    FOSS Ebook Server, With no windowing requirements

    PyShelf is an Open Source python based, ebook server, that does not and never will require a windowing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    SMPTE 2022-1 library for VLC & co

    Optimized and cross platform SMPTE 2022 FEC library in C, Python, Java

    Project moved to GitHub. https://github.com/davidfischer-ch/smpte2022lib
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    SmartBody

    Character animation system for games and simulations.

    SmartBody is available for download for Windows, Linux and OSX users. SmartBody can also be used on Android and iOS platforms. SmartBody is a character animation platform that provides the following capabilities in real time: * Locomotion (walk, jog, run, turn, strafe, jump, etc.) * Steering - avoiding obstacles and moving objects * Object manipulation - reach, grasp, touch , pick up objects * Lip Syncing - characters can speak with simultaneous lip-sync using text-to-speech or prerecorded audio * Gazing - robust gazing behavior that incorporates various parts of the body * Nonverbal behavior - gesturing, head nodding and shaking, eye saccades - Online and offline retargeting of motion - Automatic skinning and rigging SmartBody is written in C++ and can be incorporated into most game engines. SmartBody is developed at the USC Institute for Creative Technologies from the Character Animation and Simulation group: http://cas.ict.usc.edu
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XBMC

    XBMC

    A media player software and entertainment system application framework

    XBMC Media Center is a free cross-platform media player software and entertainment system application framework for Linux, Mac, and Windows. XBMC is capable of playing back and streaming all popular video, audio, and picture formats out-of-the-box.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next