Computer Vision @Techolution | UG Researcher | Deep Learning | President @MLSC VIIT Pune | 2X Patent Holder
- 🔭 I am into Deep Learning Architectures and Techniques
- 🤗 Publishing ML Models on Huggingface
- 👨💻 All of my projects are available at shreyasdixit.me
- Motto Lead Learn Inspire
- Joined Techolution as an Computer Vision Engineering Intern
- 2nd Patent Published :Second Patent in 2023 "Real-Time MultiModal Video Narration Platform for Visually Impaired People"
- Indian Patent Published : Recently published an Indian Patent titled "Assistance Platform for Visually Impaired Person using Image Captioning"
Click on the project name to directly go to it's GitHub Repository and click demo app to see a live demo of the project
- EchoSense : VisualAudioAI is a multimodality model that aims to generates Audio descriptions by looking at any given Image.[HF Space]
- HingFlow : Translation model to translate English sentence to Hindi. Used Neural Machine Translation using Transformers approach to solve this problem.[HF Space]
- MaskedLM : A Pytorch Implementation of BART Architecure for Masked Language Modeling on English & Hinglish Data.The Models are available to use on Huggingface. [HF Model card]
- Neural Translation : A transformer based Neural Machine Translation model in TensorFlow for English to Hinglish Translation.Built from scratch & achived 94% accuracy with 24hr for Neuro Hackathon.
- Image Captioning : Multimodality model for Image-Text Generation. Given any image the model outputs a text prompt describing the image.
- Open Source contributions
- I write about productivity in this weekly newsletter "Productivity Pro".
-
I use Notion daily as my second brain. I also create Notion templates for you to stay organized and Productive.
-
I have multiple Notion Templates on Notion's Official Gallery