Showing 65 open source projects for "visual object net"

View related business solutions
  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Financial reporting cloud-based software. Icon
    Financial reporting cloud-based software.

    For companies looking to automate their consolidation and financial statement function

    The software is cloud based and automates complexities around consolidating and reporting for groups with multiple year ends, currencies and ERP systems with a slice and dice approach to reporting. While retaining the structure, control and validation needed in a financial reporting tool, we’ve managed to keep things flexible.
    Learn More
  • 1
    Label Studio

    Label Studio

    Label Studio is a multi-type data labeling and annotation tool

    The most flexible data annotation tool. Quickly installable. Build custom UIs or use pre-built labeling templates. Detect objects on image, bboxes, polygons, circular, and keypoints supported. Partition image into multiple segments. Use ML models to pre-label and optimize the process. Label Studio is an open-source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 2
    LatentSync

    LatentSync

    Taming Stable Diffusion for Lip Sync

    ...In effect, given a source video (with masked or reference frames) and an audio track, LatentSync directly generates frames whose lip motions and expressions align with the audio, producing convincing talking-head or animated lip-sync output. The system leverages a U-Net diffusion backbone, with cross-attention of audio embeddings (via an audio encoder) and reference video frames to guide generation, and applies a set of loss functions (temporal, perceptual, sync-net based) to enforce lip-sync accuracy, visual fidelity, and temporal consistency. Over versions, LatentSync has improved temporal stability and lowered resource requirements — making inference more practical (e.g. 8 GB VRAM for earlier versions, somewhat higher for latest models).
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    Sa2VA

    Sa2VA

    Official Repo For "Sa2VA: Marrying SAM2 with LLaVA

    Sa2VA is a cutting-edge open-source multi-modal large language model (MLLM) developed by ByteDance that unifies dense segmentation, visual understanding, and language-based reasoning across both images and videos. It merges the segmentation power of a state-of-the-art video segmentation model (based on SAM‑2) with the vision-language reasoning capabilities of a strong LLM backbone (derived from models like InternVL2.5 / Qwen-VL series), yielding a system that can answer questions about visual content, perform referring segmentation, and maintain temporal consistency across frames in video. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MoCo (Momentum Contrast)

    MoCo (Momentum Contrast)

    Self-supervised visual learning using momentum contrast in PyTorch

    MoCo is an open source PyTorch implementation developed by Facebook AI Research (FAIR) for the papers “Momentum Contrast for Unsupervised Visual Representation Learning” (He et al., 2019) and “Improved Baselines with Momentum Contrastive Learning” (Chen et al., 2020). It introduces Momentum Contrast (MoCo), a scalable approach to self-supervised learning that enables visual representation learning without labeled data. The core idea of MoCo is to maintain a dynamic dictionary with a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Turn traffic into pipeline and prospects into customers Icon
    Turn traffic into pipeline and prospects into customers

    For account executives and sales engineers looking for a solution to manage their insights and sales data

    Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.
    Learn More
  • 5
    HunyuanWorld 1.0

    HunyuanWorld 1.0

    Generating Immersive, Explorable, and Interactive 3D Worlds

    ...It combines the strengths of video-based diversity and 3D-based geometric consistency through a novel framework using panoramic world proxies and semantically layered 3D mesh representations. This approach enables 360° immersive experiences, seamless mesh export for graphics pipelines, and disentangled object representations for enhanced interactivity. The architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction to produce high-quality scene-scale 3D worlds from both text and images. HunyuanWorld-1.0 surpasses existing open-source methods in visual quality and geometric consistency, demonstrated by superior scores in BRISQUE, NIQE, Q-Align, and CLIP metrics.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6

    sidmon5.net

    Sudden ionospheric disturbance monitor with Stokes data product

    This package is a VLF receiver for monitoring VLF transmitter signals for evidence of transients indicating ionospheric disturbances, usually caused by x-ray bursts from the sun. It takes sample pairs from dual-channel sound cards and spectrally processes them to Stokes parameters. Data are plotted as time series and in scatter plots.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Django Notebook

    Django Notebook

    Django + shell_plus + Jupyter notebooks made easy

    Django + shell_plus + Jupyter notebooks made easy. A Jupyter notebook with access to objects from the Django ORM is a powerful tool to introspect data and run ad-hoc queries. Built-in integration with the imported objects from django-extensions shell_plus. Saves the state between sessions so you don't need to remember what you did. Inheritance diagrams on any object, including ORM models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Pytorch-toolbelt

    Pytorch-toolbelt

    PyTorch extensions for fast R&D prototyping and Kaggle farming

    A pytorch-toolbelt is a Python library with a set of bells and whistles for PyTorch for fast R&D prototyping and Kaggle farming. Easy model building using flexible encoder-decoder architecture. Modules: CoordConv, SCSE, Hypercolumn, Depthwise separable convolution and more. GPU-friendly test-time augmentation TTA for segmentation and classification. GPU-friendly inference on huge (5000x5000) images. Every-day common routines (fix/restore random seed, filesystem utils, metrics). Losses:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Python Progressbar

    Python Progressbar

    Progressbar 2 - A progress bar for Python 2 and Python 3

    ...The ProgressBar class manages the current progress, and the format of the line is given by a number of widgets. A widget is an object that may display differently depending on the state of the progress bar.
    Downloads: 0 This Week
    Last Update:
    See Project
  • DAT Freight and Analytics - DAT Icon
    DAT Freight and Analytics - DAT

    DAT Freight and Analytics operates DAT One truckload freight marketplace

    DAT Freight & Analytics operates DAT One, North America’s largest truckload freight marketplace; DAT iQ, the industry’s leading freight data analytics service; and Trucker Tools, the leader in load visibility. Shippers, transportation brokers, carriers, news organizations, and industry analysts rely on DAT for market trends and data insights, informed by nearly 700,000 daily load posts and a database exceeding $1 trillion in freight market transactions. Founded in 1978, DAT is a business unit of Roper Technologies (Nasdaq: ROP), a constituent of the Nasdaq 100, S&P 500, and Fortune 1000. Headquartered in Beaverton, Ore., DAT continues to set the standard for innovation in the trucking and logistics industry.
    Learn More
  • 10
    DWSIM - Open Source Process Simulator

    DWSIM - Open Source Process Simulator

    Simulate chemical processes using advanced thermodynamic models

    DWSIM is an open source, CAPE-OPEN compliant chemical process simulator for Windows, Linux and macOS systems. Written in VB.NET and C#, DWSIM features a comprehensive set of unit operations, advanced thermodynamic models, support for reacting systems, petroleum characterization tools and a fully-featured graphical interface. DWSIM Pro is a commercial sibling of DWSIM built on top of open-source software. It offers extended features, comes with private support, and is accessible in the...
    Leader badge
    Downloads: 1,371 This Week
    Last Update:
    See Project
  • 11

    lms2fits

    Dual-channel spectroscopic receiver using LimeSDR-USB

    `lms2fits` is a dual-channel spectroscopic receiver for radio astronomy that employs theLimeSDR's LimeSDR-USB dual-channel transceiver, which in turn employs Lime Microsystems' LMS7002M transceiver chip. These systems allow a frequency-agile (<30 MHz to 3.8 GHz) receiver providing Stokes parameters in dynamic spectra of up to 60 MHz analog bandwidth streamed to a FITS file with three-axis primary table. It runs on linux under .Net.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SCons

    SCons

    A software construction tool

    SCons is a software construction tool that is a superior alternative to the classic "Make" build tool that we all know and love. SCons is implemented as a Python script and set of modules, and SCons "configuration files" are actually executed as Python scripts. This gives SCons many powerful capabilities not found in other software build tools. We make SCons available in three distinct packages, for different purposes. - The scons package is the basic package to install SCons. You...
    Leader badge
    Downloads: 1,186 This Week
    Last Update:
    See Project
  • 13
    Universal runtime installer

    Universal runtime installer

    This installer allows to install the latest Windows Runtimes

    This installer allows to install the latest Visual C++ Runtime of all years (2008-2022), Full DirectX Runtime, Microsoft XNA Framework, .Net Runtime, Java and OpenAL at once.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 14
    Wait and Pounce FT8

    Wait and Pounce FT8

    Wait to decode callsigns, then call them using UDP protocol

    Neat alternative for JTAlert. Available for both Apple and Windows (10+). Make sure to automatically call any Wanted callsigns specified. Can also be used to monitor band opening. Can handle callsign exclusions. Can even handle wanted callsigns according to your worked before Rules.
    Downloads: 222 This Week
    Last Update:
    See Project
  • 15

    rx2fits

    HF/VHF spectrosopy code for the rx888mk2 direct-sampling receiver

    rx2fits is an SDR code for the rx888mk2 direct-sampling receiver that streams spectral data with frequency up to 65 MHz to FITS files. This code is for the direct-sampling input of the receiver, which transmits real samples over USB3 at up to 130 MHz sample rate. rx2fits processes these samples to spectral intensities via a Fourier-transform poly-phase filter bank, which provides spectral resolution approaching the spectral bin width with good stop-band and adjacent-channel rejection. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Snap7

    Snap7

    32/64 bit multi-platform Ethernet S7 PLC communication suite

    Snap7, through three specialized components: Client and the inedited Server and Partner, allows you to definitively integrate your PC based systems into a PLC automation chain. Designed to transfer large amounts of hi-speed data in industrial facilities, it scales easily, down to small Linux Arm boards such as Raspberry PI. Hi level object oriented wrappers are provided, currently C/C++, .NET/Mono, Pascal, LabVIEW, Python with many source code examples. Very easy to use, a full working server example is not bigger than the “Hello world”. Many projects/makefiles are ready to run to easily rebuild Snap7 in any platform without the need of be a C++ guru. Very detailed documentation provided.
    Leader badge
    Downloads: 679 This Week
    Last Update:
    See Project
  • 17
    Bhakti-Apps
    This app is meant for a simplified user experience, perfect for note takers. It has a good encryption system, and the best feature is that it works on the CLI, so its compatibility is great.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    ufonet

    ufonet

    UFONet - Denial of Service Toolkit

    UFONet - Is a set of hacktivist tools that allow launching coordinated DDoS and DoS attacks and combine both in a single offensive. It also works as an encrypted DarkNET to publish and receive content by creating a global client/server network based on a direct-connect P2P architecture. + FAQ: https://ufonet.03c8.net/FAQ.html -------------------------------------------- -> UFONet-v1.8 [DPh] "DarK-PhAnT0m!" (.zip) -> md5 = [ c8ab016f6370c8391e2e6f9a7cbe990a ] -> UFONet-v1.8...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19
    Nougat

    Nougat

    Implementation of Nougat Neural Optical Understanding

    Nougat is a multi-modal generative modeling framework that bridges vision and text modalities with structured generation control (e.g. layout, scene composition) rather than treating images as flat contexts. It combines object-centric modules with transformer-based reasoning to propose, refine, and render scenes in a generative pipeline. The architecture allows you to specify or prompt a layout (which objects should be where) and then the model fills in appearance, context, lighting, and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SMC - The State Machine Compiler

    SMC - The State Machine Compiler

    Translates state machine into a target programming language.

    SMC takes a state machine stored in a .sm file and generates a State pattern in 14 programming languages. Includes: default transitions, transition args, transition guards, push/pop transitions and Entry/Exit actions. See User Manual for more info.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 21
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ELECHEM & FARADAY (Electrochemical)

    ELECHEM & FARADAY (Electrochemical)

    Simulation tool for electrochemical / electroplating parameters

    FARADAY and ELECHEM are the two portable computer programs, find useful for electrochemical researchers, electroplating units, battery as well as supercapacitor industries, process engineers, materials scientists & engineers, electroplaters, hobbyists, and students to evaluate various electrochemical deposition parameters. These computational tools can be used to simulate various electrochemical deposition data or electroplating parameters for metals, alloys, composites as well as energy...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Battery Life Saver

    Battery Life Saver

    Battery Life Saver can extend the life span of laptop batteries.

    Battery Life Saver can extend the life span of laptop batteries. Using a Laptop Battery continuously by overcharging above 90% or by below 15%, reduces its life span. This light-weight .exe program, Battery Life Saver will alarm, on excess charging or discharging. Set the desired Battery Limit Percentage using the slider, say 50% and allow the Battery Life Saver to give alarm. It automatically detects, charging and battery status. For more advanced options, contact the...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    NH Folder Hider and Locker

    NH Folder Hider and Locker

    NH Folder Hider and Locker 1.0 is a powerful application which provide

    NH Folder Hider and Locker 1.0 is a powerful application which provides you the means to protect your PC files from the unauthorized access. As we all know that nowadays every computer is connected with the Internet so there is always a threat for your personal data to be accessed. There is always danger if you are having the confidential files in your PC so this software offers that extra sense of security. It has a well-organized and very interactive interface which provides you the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Surface Defect Detection Dataset Papers

    Surface Defect Detection Dataset Papers

    Constantly summarizing open source dataset and critical papers

    ...Generally speaking, imaging schemes are usually designed by using the different properties of the inspected surface or defects. A reasonable imaging scheme helps to obtain images with uniform illumination and clearly reflect the surface defects of the object. In recent years, many defect detection methods based on deep learning have also been widely used in various industrial scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next