SHORT-VIDEO-CREATOR-SIMPLIFIED

Introduction

SHORT-VIDEO-CREATOR-SIMPLIFIED is a powerful Node.js application designed to revolutionize the content creation process for short-form videos. By harnessing the capabilities of various AI services, this tool automates the generation of engaging scripts, lifelike voice narrations, compelling images, background music, and videos, producing a comprehensive package of content ready for final video editing.

Project Overview

This project aims to streamline the content creation pipeline by integrating several cutting-edge AI services:

Language Model (LLM) for dynamic script generation
Voice Generation (Elevenlabs) for natural-sounding narration
Image Creation (Midjourney) for visually stunning scenes
Music Generation (Suno) for custom background tracks
Video Generation (Immersity AI) for creating video content from images

The system processes input from a CSV file containing multiple prompts, leverages these AI services, and outputs a structured set of files primed for import into video editing software such as Capcut, significantly reducing the time and effort required in the content creation process.

Features

Efficient CSV input processing for batch content creation with multiple prompts
AI-powered script generation using advanced GPT models
Realistic voice narration synthesis using Elevenlabs
AI-generated images using Midjourney
Custom background music generation using Suno AI
Video generation from images using Immersity AI
Structured output optimized for video editing workflows
Highly configurable pipeline to suit various content needs
Robust error handling and comprehensive logging
Separate test environment for LLM, voice generation, image generation, music generation, and video generation services
Integration test for end-to-end workflow verification

Prerequisites

Node.js (v14.0.0 or later)
npm (v6.0.0 or later)
API keys and authentication for the following services:
- OpenAI (GPT) for script generation
- Elevenlabs for voice synthesis
- Midjourney for image generation
- Suno for music generation
- Immersity AI for video generation

Installation

Clone the repository:

git clone https://github.com/yourusername/SHORT-VIDEO-CREATOR-SIMPLIFIED.git
cd SHORT-VIDEO-CREATOR-SIMPLIFIED

Install Node.js dependencies:
```
npm install
```

Configuration

Copy config/default.example.json to config/default.json

Edit config/default.json and add your API keys and other settings:

{
  "llm": {
    "provider": "openai",
    "model": "gpt-4o-2024-08-06",
    "apiKey": "YOUR_OPENAI_API_KEY"
  },
  "voiceGen": {
    "provider": "elevenlabs",
    "apiKey": "YOUR_ELEVENLABS_API_KEY"
  },
  "imageGen": {
    "provider": "midjourney",
    "serverId": "YOUR_DISCORD_SERVER_ID",
    "channelId": "YOUR_DISCORD_CHANNEL_ID",
    "salaiToken": "YOUR_DISCORD_TOKEN",
    "debug": true,
    "ws": true
  },
  "audioGen": {
    "provider": "suno",
    "sunoCookie": "YOUR_SUNO_COOKIE_HERE",
    "sessionId": "YOUR_SUNO_SESSION_ID_HERE"
  },
  "videoGen": {
    "provider": "immersityAI",
    "clientId": "YOUR_IMMERSITY_CLIENT_ID",
    "clientSecret": "YOUR_IMMERSITY_CLIENT_SECRET"
  },
  "input": {
    "csvPath": "./data/input/input.csv"
  },
  "output": {
    "directory": "./data/output"
  }
}

Usage

Prepare your input CSV file in data/input/input.csv with multiple prompts, one per line
Set up your parameters in data/input/parameters.json
Customize the initial prompt in data/input/initial_prompt.txt
Run the application:
```
npm start
```
Find the generated content in the data/output directory

Project Structure

SHORT-VIDEO-CREATOR-SIMPLIFIED/
├── config/
│   ├── default.example.json
│   └── default.json
├── data/
│   ├── input/
│   │   ├── initial_prompt.txt
│   │   ├── input.csv
│   │   └── parameters.json
│   └── output/
├── logs/
├── src/
│   ├── services/
│   │   ├── llm-service.js
│   │   ├── voice-gen-service.js
│   │   ├── image-gen-service.js
│   │   ├── music-gen-service.js
│   │   ├── video-gen-service.js
│   │   └── suno_auth.js
│   ├── utils/
│   │   ├── config.js
│   │   ├── logger.js
│   │   ├── prompt-utils.js
│   │   └── audio-utils.js
│   ├── workflows/
│   │   └── content-pipeline.js
│   ├── models.js
│   └── index.js
├── tests/
│   ├── llm-test.js
│   ├── voice-gen-test.js
│   ├── image-gen-test.js
│   ├── music-gen-test.js
│   ├── video-gen-test.js
│   ├── integration-test.js
│   └── test_output/
│       ├── llm/
│       ├── voice/
│       ├── image/
│       ├── music/
│       ├── video/
│       └── integration/
├── .gitignore
├── package.json
└── README.md

Architecture

The application follows a modular architecture designed for flexibility and maintainability:

Input Processing: Parses CSV input with multiple prompts and loads configuration parameters
LLM Service: Generates dynamic script content based on input and parameters
Voice Generation Service: Synthesizes natural-sounding narration from the generated script
Image Generation Service: Creates visual content based on scene descriptions
Music Generation Service: Produces custom background music tracks
Video Generation Service: Creates video content from generated images
Content Pipeline: Orchestrates the flow between services and manages the overall process
Output Formatting: Structures and saves the generated content in an editor-friendly format

API Integrations

LLM: Leverages OpenAI's GPT models for advanced script generation
Voice Generation: Integrates with Elevenlabs for high-quality voice synthesis
Image Generation: Utilizes Midjourney's API for creating visual content
Music Generation: Uses Suno AI for custom background music creation
Video Generation: Employs Immersity AI for generating videos from images

Detailed documentation for each service integration can be found in the respective files within the src/services/ directory.

Output Format

The generated content is structured as follows for each video:

output/
└── YYYY-MM-DD_HH-MM-SS/
    ├── prompt_1/
    │   ├── llm_output.json
    │   ├── background_music.mp3
    │   ├── project_metadata.json
    │   └── scene_1/
    │       ├── voice.mp3
    │       ├── image.png
    │       ├── video.mp4
    │       └── metadata.json
    ├── prompt_2/
    │   ├── llm_output.json
    │   ├── background_music.mp3
    │   ├── project_metadata.json
    │   └── scene_1/
    │       ├── voice.mp3
    │       ├── image.png
    │       ├── video.mp4
    │       └── metadata.json
    └── ...

This structure is optimized for seamless import into video editing software, allowing for efficient post-processing and finalization.

Testing

The project includes separate test files for the LLM, voice generation, image generation, music generation, video generation services, and an integration test:

To run all tests:
```
npm test
```
To run only the LLM test:
```
npm run test:llm
```
To run only the voice generation test:
```
npm run test:voice
```
To run only the image generation test:
```
npm run test:image
```
To run only the music generation test:
```
npm run test:music
```
To run only the video generation test:
```
npm run test:video
```
To run the integration test:
```
npm run test:integration
```

Test outputs are stored in the tests/test_output/ directory. The integration test processes all scenes for each prompt in the input CSV, exercising all components of the pipeline.

Troubleshooting

Review the logs/app.log file for detailed error messages and execution logs
Ensure all API keys and authentication details are correctly set in the config/default.json file
Verify that the input CSV, parameters JSON, and initial prompt TXT files are correctly formatted and located in the data/input/ directory
Check that all required npm packages are installed
For Midjourney-specific issues, ensure your Discord bot has the necessary permissions and that the server and channel IDs are correct
For Suno-specific issues, ensure your cookie and session ID are up-to-date and valid
For Immersity AI-specific issues, verify that the client ID and client secret are correct
If the integration test fails, check individual component tests to isolate the issue

Contributing

Contributions to SHORT-VIDEO-CREATOR-SIMPLIFIED are welcome! Please follow these steps:

Fork the repository
Create a new branch for your feature or bug fix
Commit your changes with clear, descriptive messages
Push the branch to your fork
Submit a pull request with a comprehensive description of your changes

For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SHORT-VIDEO-CREATOR-SIMPLIFIED

Table of Contents

Introduction

Project Overview

Features

Prerequisites

Installation

Configuration

Usage

Project Structure

Architecture

API Integrations

Output Format

Testing

Troubleshooting

Contributing

License

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
config		config
data/input		data/input
node_modules		node_modules
src		src
tests		tests
.gitignore		.gitignore
Readme.md		Readme.md
package-lock.json		package-lock.json
package.json		package.json

Alazka2k/Short-Video-Creator-Simplified

Folders and files

Latest commit

History

Repository files navigation

SHORT-VIDEO-CREATOR-SIMPLIFIED

Table of Contents

Introduction

Project Overview

Features

Prerequisites

Installation

Configuration

Usage

Project Structure

Architecture

API Integrations

Output Format

Testing

Troubleshooting

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages