EVERYTHING AI FREE DOWNLOADS
DICE DREAM DIFFUSION
投稿日 Oct 16, 2024
EVERYTHING AI FREE DOWNLOADS - By DICE
I have compiled a list of the best free to use or free to install AI tools available for 2024. This list will be updated every fortnight with new releases for you to try and test.
Below you will find every tool needed to help with your projects and work. If you are a Developer of tools or services please feel free to contact me and get yourself on the list....
I hope you find these helpful and please leave feedback on how you found they run, so as other users can see if the tool or service suits there needs to.
Thanks guys and enjoy .... DIce
SCRIPT VERSION 1.5
Face Fusion 3.0.0
Next generation face swapper and enhancer
SCRIPT VERSION 1.5
Hallo
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
SCRIPT VERSION 1.5
Flash Diffusion
Accelerating any conditional diffusion model for few steps image generation
SCRIPT VERSION 1.5
Chat-With-Mlx
[Mac Onlyl] An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.
PCM
Phased Consistency Model - generate high quality images with 2 steps
SCRIPT VERSION 1.5
Stable Audio
An Open Source Model for Audio Samples and Sound Design
SCRIPT VERSION 1.5
SillyTavern
A local-install interface that allows you to interact with text
generation AIs (LLMs) to chat and roleplay with custom characters.
SCRIPT VERSION 1.5
AITown
Build and customize your own version of AI town - a virtual town where AI characters live, chat and socialize
Augmentoolkit
Turn any raw text into a high-quality dataset for AI finetuning
LoRA the Explorer
Stable Diffusion LoRA Playground HuggingFace:
lavie
Text-to-Video (T2V) generation framework from Vchitect
SCRIPT VERSION 1.3
Dust3r
Geometric 3D Vision Made Easy
SCRIPT VERSION 1.5
LlamaFactory
Unify Efficient Fine-Tuning of 100+ LLMs
SCRIPT VERSION 1.5
Invoke
The Gen AI Platform for Pro Studios
SCRIPT VERSION 1.5
Openui
Describe UI and see it rendered live. Ask for changes and convert HTML to React,
Svelte, Web Components, etc. Like vercel v0, but open source
XTTS
clone voices into different languages by using just a quick 3-second audio clip. (a local version of
RVC
1 Click Installer for Retrieval-based-Voice-Conversion-WebUI
LCM
Fast Image generator using Latent consistency models
SCRIPT VERSION 1.3
Whisper-WebUI
A Web UI for easy subtitle using whisper model
Realtime BakLLaVA
llama.cpp with BakLLaVA model describes what does it see
Realtime StableDiffusion
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server
SCRIPT VERSION 1
StreamDiffusion
[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation
SCRIPT VERSION 1
Moore-AnimateAnyone
[NVIDIA GPU ONLY] Unofficial Implementation of Animate Anyone
SCRIPT VERSION 1
Moore-AnimateAnyone-Mini
[NVIDIA ONLY] Efficient Implementation of Animate Anyone (13G VRAM + 2G model size)
SCRIPT VERSION 1
PhotoMaker
Customizing Realistic Human Photos via Stacked ID Embedding
SCRIPT VERSION 1.1
BRIA RMBG
Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use
SCRIPT VERSION 1.2
Gligen
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
SCRIPT VERSION 1.2
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean
Chatbot-Ollama
open source chat UI for Ollama
SCRIPT VERSION 1.2
Differential-diffusion-ui
Differential Diffusion modifies an image according to a text prompt, and according
to a map that specifies the amount of change in each region
SCRIPT VERSION 1.2
Supir
[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life
SCRIPT VERSION 1.5
ZeST
ZeST: Zero-Shot Material Transfer from a Single Image. Local port of
SCRIPT VERSION 1.5
StoryDiffusion Comics
create a story by generating consistent images
SCRIPT VERSION 1.2
Lobe Chat
An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports
speech-synthesis, multi-modal, and extensible (function call) plugin
system.
SCRIPT VERSION 1.5
Parler-tts
A lightweight text-to-speech (TTS) model that can generate high-quality
speech with features that can be controlled using a simple text prompt
(e.g. gender, background noise, speaking rate, pitch and reverberation).
SCRIPT VERSION 1.5
Instantstyle
Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required
SCRIPT VERSION 1.5
Openvoice2
Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS
SCRIPT VERSION 1.5
IDM-VTON
Improving Diffusion Models for Authentic Virtual Try-on in the Wild
SCRIPT VERSION 1.5
Devika
Agentic AI Software Engineer
SCRIPT VERSION 1.2
Open WebUI
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs
SCRIPT VERSION 1.5
CosXL
Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI,
SCRIPT VERSION 1.5
Face-to-all
Diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of
SCRIPT VERSION 1.5
CustomNet
A unified encoder-based framework for object customization in text-to-image diffusion models
SCRIPT VERSION 1.5
Brushnet
A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
SCRIPT VERSION 1.5
Arc2Face
A Foundation Model of Human Faces
SCRIPT VERSION 1.2
TripoSR
A state-of-the-art open-source model for fast feedforward 3D
reconstruction from a single image, developed in collaboration between
Tripo AI and Stability AI.
SCRIPT VERSION 1.2
ZETA
Zero-Shot Text-Based Audio Editing Using DDPM Inversion
SCRIPT VERSION 1.2
Remove-video-bg
Video background removal tool
SCRIPT VERSION 1.1
[NVIDIA GPU ONLY] LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
SCRIPT VERSION 1
vid2pose
Video to Openpose & DWPose (All OS supported)
SCRIPT VERSION 1
IP-Adapter-FaceID
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model
SCRIPT VERSION 1
Dreamtalk
When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
SCRIPT VERSION 1
Video2Openpose
Turn any video into Openpose video
MagicAnimate Mini
[NVIDIA GPU Only] An optimized version of MagicAnimate
MagicAnimate
[NVIDIA GPU Only] Temporally Consistent Human Image Animation using Diffusion Model
AudioSep
Separate Anything You Describe
Tokenflow
Temporally consistent video editing. A local version of
ModelScope Image2Video (Nvidia GPU only)
Turn any image into a video! (Web UI created by fffiloni:
Text Generation WebUI
A Gradio web UI for Large Language Models
SCRIPT VERSION 1
MAGNeT
A text-to-music and text-to-sound model capable of generating
high-quality audio samples conditioned on text descriptions
SCRIPT VERSION 1
VideoCrafter 2
[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is
an open-source video generation and editing toolbox for crafting video
content. It currently includes the Text2Video and Image2Video models
SCRIPT VERSION 1.1
Bark Voice Cloning
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic,
type your text-to-speech prompt and hit submit! A local version of