* . *
  • About
  • Advertise
  • Privacy & Policy
  • Contact
Saturday, July 12, 2025
Earth-News
  • Home
  • Business
  • Entertainment
    How you can see new movies early – Yahoo

    Unlock the Secret to Watching New Movies Before Everyone Else!

    Immersive sports and entertainment venue Cosm set to build its 5th location in Cleveland – WKYC

    Cosm Reveals Exciting Vision for Its 5th Immersive Sports and Entertainment Venue in Cleveland

    Monumental Sports & Entertainment’s Samantha Brady on the Power of the RSN’s Direct-to-Consumer Streaming Service Monumental+ – Sports Video Group

    Samantha Brady Reveals How Monumental+ is Transforming Sports Streaming with Direct-to-Consumer Access

    Moses Singer Welcomes Entertainment and Intellectual Property Partner Frederick Bimbler – Yahoo Finance

    Moses Singer Expands Team with New Entertainment and Intellectual Property Partner Frederick Bimbler

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Stallion Uranium Provides Update on Technology Data Acquisition Agreement – GlobeNewswire

    Stallion Uranium Announces Exciting Progress in Technology Data Acquisition Agreement

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    SMPTE Opens Early Bird Registration for Media Technology Summit – TVTechnology

    SMPTE Launches Early Bird Registration for Exciting Media Technology Summit

    Google Fiber puts Nokia network slicing technology to the test – Fierce Network

    Google Fiber Puts Nokia’s Network Slicing Technology to the Ultimate Test

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
  • Home
  • Business
  • Entertainment
    How you can see new movies early – Yahoo

    Unlock the Secret to Watching New Movies Before Everyone Else!

    Immersive sports and entertainment venue Cosm set to build its 5th location in Cleveland – WKYC

    Cosm Reveals Exciting Vision for Its 5th Immersive Sports and Entertainment Venue in Cleveland

    Monumental Sports & Entertainment’s Samantha Brady on the Power of the RSN’s Direct-to-Consumer Streaming Service Monumental+ – Sports Video Group

    Samantha Brady Reveals How Monumental+ is Transforming Sports Streaming with Direct-to-Consumer Access

    Moses Singer Welcomes Entertainment and Intellectual Property Partner Frederick Bimbler – Yahoo Finance

    Moses Singer Expands Team with New Entertainment and Intellectual Property Partner Frederick Bimbler

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Longhua District and Max-Matching Entertainments, supported by RWS Global forge strategic partnership to develop international IP-themed entertainment complex – Amusement Today

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

    Government whip to withdraw Entertainment Complex Bill on July 9 – Nation Thailand

  • General
  • Health
  • News

    Cracking the Code: Why China’s Economic Challenges Aren’t Shaking Markets, Unlike America’s” – Bloomberg

    Trump’s Narrow Window to Spread the Truth About Harris

    Trump’s Narrow Window to Spread the Truth About Harris

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    Israel-Gaza war live updates: Hamas leader Ismail Haniyeh assassinated in Iran, group says

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    PAP Boss to Niger Delta Youths, Stay Away from the Protest

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Court Restricts Protests In Lagos To Freedom, Peace Park

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Fans React to Jazz Jennings’ Inspiring Weight Loss Journey

    Trending Tags

    • Trump Inauguration
    • United Stated
    • White House
    • Market Stories
    • Election Results
  • Science
  • Sports
  • Technology
    Stallion Uranium Provides Update on Technology Data Acquisition Agreement – GlobeNewswire

    Stallion Uranium Announces Exciting Progress in Technology Data Acquisition Agreement

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    2025 WE Local Prague Recap: Inspiring Women in Engineering and Technology – Society of Women Engineers

    SMPTE Opens Early Bird Registration for Media Technology Summit – TVTechnology

    SMPTE Launches Early Bird Registration for Exciting Media Technology Summit

    Google Fiber puts Nokia network slicing technology to the test – Fierce Network

    Google Fiber Puts Nokia’s Network Slicing Technology to the Ultimate Test

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    Kaseya Extends Community Investment with Addition of Technology Marketing Toolkit – Kaseya

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    AI and the Trust Revolution: How Technology Is Transforming Human Connections – Foreign Affairs

    Trending Tags

    • Nintendo Switch
    • CES 2017
    • Playstation 4 Pro
    • Mark Zuckerberg
No Result
View All Result
Earth-News
No Result
View All Result
Home Technology

Ferret: An End-to-End MLLM by Apple

December 23, 2023
in Technology
Ferret: An End-to-End MLLM by Apple
Share on FacebookShare on Twitter

An End-to-End MLLM that Accept Any-Form Referring and Ground Anything in Response. [Paper]

Haoxuan You*, Haotian Zhang*, Zhe Gan, Xianzhi Du, Bowen Zhang, Zirui Wang, Liangliang Cao, Shih-Fu Chang, Yinfei Yang
[*: equal contribution]

Overview


Diagram of Ferret Model.

Key Contributions:

Ferret Model – Hybrid Region Representation + Spatial-aware Visual Sampler enable fine-grained and open-vocabulary referring and grounding in MLLM.
GRIT Dataset (~1.1M) – A Large-scale, Hierarchical, Robust ground-and-refer instruction tuning dataset.
Ferret-Bench – A multimodal evaluation benchmark that jointly requires Referring/Grounding, Semantics, Knowledge, and Reasoning.

Release

[12/14] 🔥 We released the checkpoints(7B, 13B).
[10/30] 🔥 We released the code of FERRET model and Ferret-Bench.

Usage and License Notices: The data, and code is intended and licensed for research use only. They are also restricted to uses that follow the license agreement of LLaMA, Vicuna and GPT-4. The dataset is CC BY NC 4.0 (allowing only non-commercial use) and models trained using the dataset should not be used outside of research purposes.

Contents

Install
Train
Evaluation
Demo

Install

Clone this repository and navigate to FERRET folder

git clone https://github.com/apple/ml-ferret
cd ml-ferret

Install Package

conda create -n ferret python=3.10 -y
conda activate ferret
pip install –upgrade pip # enable PEP 660 support
pip install -e .
pip install pycocotools
pip install protobuf==3.20.0

Install additional packages for training cases

pip install ninja
pip install flash-attn –no-build-isolation

Train

FERRET is trained on 8 A100 GPUs with 80GB memory. To train on fewer GPUs, you can reduce the per_device_train_batch_size and increase the gradient_accumulation_steps accordingly. Always keep the global batch size the same: per_device_train_batch_size x gradient_accumulation_steps x num_gpus.

Hyperparameters

We use a similar set of hyperparameters as LLaVA(Vicuna) in finetuning.

Hyperparameter
Global Batch Size
Learning rate
Epochs
Max length
Weight decay

FERRET-7B
128
2e-5
3
2048
0

FERRET-13B
128
2e-5
3
2048
0

Prepare Vicuna checkpoint and LLaVA’s projector

Before you start, prepare our base model Vicuna, which is an instruction-tuned chatbot. Please download its weights following the instructions here. Vicuna v1.3 is used in FERRET.

Then download LLaVA’s first-stage pre-trained projector weight (7B, 13B).

FERRET Training

The scripts are provided (7B, 13B).

Evaluation

Please see this doc for the details.

Checkpoints

We extracted the delta between our pre-trained model and Vicuna. Please first download weights of Vicuna following the previous instruction. Then download our prepared offsets of weights: 7B, 13B using wget or curl, and unzip the downloaded offsets. Lastly, apply the offset to the Vicuna’s weight by running the following script:

# 7B
python3 -m ferret.model.apply_delta
–base ./model/vicuna-7b-v1-3
–target ./model/ferret-7b-v1-3
–delta path/to/ferret-7b-delta
# 13B
python3 -m ferret.model.apply_delta
–base ./model/vicuna-13b-v1-3
–target ./model/ferret-13b-v1-3
–delta path/to/ferret-13b-delta

Notices: Apple’s rights in the attached weight differentials are hereby licensed under the CC-BY-NC license. Apple makes no representations with regards to LLaMa or any other third party software, which are subject to their own terms.

Please refer to the next section about how to set up a local demo with pre-trained weight.

Demo

To run our demo, you need to train FERRET and use the checkpoints locally. Gradio web UI is used. Please run the following commands one by one.

Launch a controller
python -m ferret.serve.controller –host 0.0.0.0 –port 10000
Launch a gradio web server.
python -m ferret.serve.gradio_web_server –controller http://localhost:10000 –model-list-mode reload –add_region_feature
Launch a model worker

This is the worker that load the ckpt and do the inference on the GPU. Each worker is responsible for a single model specified in –model-path.

CUDA_VISIBLE_DEVICES=0 python -m ferret.serve.model_worker –host 0.0.0.0 –controller http://localhost:10000 –port 40000 –worker http://localhost:40000 –model-path ./checkpoints/FERRET-13B-v0 –add_region_feature

Wait until the process finishes loading the model and you see “Uvicorn running on …”. Now, refresh your Gradio web UI, and you will see the model you just launched in the model list.


Example of Ferret Interactive Demo.

Citation

If you find Ferret useful, please cite using this BibTeX:

@article{you2023ferret,
title={Ferret: Refer and Ground Anything Anywhere at Any Granularity},
author={You, Haoxuan and Zhang, Haotian and Gan, Zhe and Du, Xianzhi and Zhang, Bowen and Wang, Zirui and Cao, Liangliang and Chang, Shih-Fu and Yang, Yinfei},
journal={arXiv preprint arXiv:2310.07704},
year={2023}
}

Acknowledgement

LLaVA: the codebase we built upon.
Vicuna: the LLM codebase.

>>> Read full article>>>
Copyright for syndicated content belongs to the linked Source : Hacker News – https://github.com/apple/ml-ferret

Tags: end-to-endFerrettechnology
Previous Post

AI companies required to disclose copyrighted training data under new bill

Next Post

You Can Train Your Brain to React to Stressful Situations Better. Here’s the 3-Step Process.

Seattle Seahawks boost Alaskan high school flag football with training camp – Alaska’s News Source

Seattle Seahawks Elevate Alaskan High School Flag Football with Exciting Training Camp

July 12, 2025
Pope prays for conversion of those who don’t ‘recognize the urgency’ of caring for creation – OSV News

Pope prays for conversion of those who don’t ‘recognize the urgency’ of caring for creation – OSV News

July 12, 2025
New, non-profit in Baldwin County aims at helping fund forensic science related investigations – fox10tv.com

New, non-profit in Baldwin County aims at helping fund forensic science related investigations – fox10tv.com

July 12, 2025
Major Gifts Transform Marine Science at William & Mary and VIMS – Virginia Living

Transformative Major Gifts Propel Marine Science Breakthroughs at William & Mary and VIMS

July 12, 2025
CatanaGroup Launches SEATY: A New Floating Lifestyle Concept – Cruising World Magazine

CatanaGroup Unveils SEATY: Dive Into the Ultimate Floating Lifestyle Experience

July 12, 2025
Readers, we need your help picking the best high school football team – Tulsa World

Vote Now for the Ultimate High School Football Champion!

July 12, 2025
An economy in India lifted by women – The Christian Science Monitor

An economy in India lifted by women – The Christian Science Monitor

July 12, 2025
How you can see new movies early – Yahoo

Unlock the Secret to Watching New Movies Before Everyone Else!

July 12, 2025
Why it’s a rough time to be a health insurer – Axios

Why it’s a rough time to be a health insurer – Axios

July 12, 2025
State Department is firing more than 1,300 staff on Friday – CNN

Over 1,300 State Department Employees Face Layoffs This Friday

July 12, 2025

Categories

Archives

July 2025
MTWTFSS
 123456
78910111213
14151617181920
21222324252627
28293031 
« Jun    
Earth-News.info

The Earth News is an independent English-language daily published Website from all around the World News

Browse by Category

  • Business (20,132)
  • Ecology (717)
  • Economy (740)
  • Entertainment (21,627)
  • General (15,848)
  • Health (9,777)
  • Lifestyle (747)
  • News (22,149)
  • People (742)
  • Politics (750)
  • Science (15,958)
  • Sports (21,239)
  • Technology (15,724)
  • World (723)

Recent News

Seattle Seahawks boost Alaskan high school flag football with training camp – Alaska’s News Source

Seattle Seahawks Elevate Alaskan High School Flag Football with Exciting Training Camp

July 12, 2025
Pope prays for conversion of those who don’t ‘recognize the urgency’ of caring for creation – OSV News

Pope prays for conversion of those who don’t ‘recognize the urgency’ of caring for creation – OSV News

July 12, 2025
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

No Result
View All Result

© 2023 earth-news.info

Go to mobile version