Margaret Mitchell's picture

Margaret Mitchell

meg

·

http://www.m-mitchell.com

AI & ML interests

natural language processing, computer vision, ethical artificial intelligence, assistive and augmentative technology

Recent Activity

replied to their post about 9 hours ago

🤖 ICYMI: Yesterday, Hugging Face and OpenAI partnered to bring open source GPT to the public. This is a Big Deal in "AI world". 0. Common ground setting: OpenAI is the ChatGPT people. An “open source” model is one whose weights are available — that means the model can be “yours”. 1. You don’t have to interact with the company directly, nor give them your interactions, to use the system. The company can't "surveil" you. 2. You can evaluate the unique contributions of their SOTA model much more rigorously than you can when there are collections of models+code behind a closed API. You can find out specifically what the model can and can't do. 3. And you can directly customize it for whatever you'd like. Fine-tuning, wherein you give the model data that's tailored to your use cases and train it some more on that data, is trivial* when you have the model weights. *Provided you have the compute. 4. You can directly benchmark whatever you'd like. Biases? Energy usage? Strengths/weaknesses? Go for it. You wants it you gots it--this transparency helps people understand SOTA *in general*, not just for this model, but points to, e.g., what's going on with closed Google models as well. 5. One of the most powerful things about "openness" that I've learned is that it cultivates ecosystems of collaborators building on top of one another's brilliance to make systems that are significantly better than they would be if created in isolation. But, caveat wrt my own philosophy... 6. I do not take it as a given that advancing LLMs is good, and have a lot more to say wrt where I think innovation should focus more. For example, a focus on *data* -- curation, measurement, consent, credit, compensation, safety -- would deeply improve technology for everyone. 7. The transparency this release provides is massive for people who want to *learn* about LLMs. For the next generation of technologists to advance over the current, they MUST be able to learn about what's happening now. (cont...)

posted an update about 9 hours ago

🤖 ICYMI: Yesterday, Hugging Face and OpenAI partnered to bring open source GPT to the public. This is a Big Deal in "AI world". 0. Common ground setting: OpenAI is the ChatGPT people. An “open source” model is one whose weights are available — that means the model can be “yours”. 1. You don’t have to interact with the company directly, nor give them your interactions, to use the system. The company can't "surveil" you. 2. You can evaluate the unique contributions of their SOTA model much more rigorously than you can when there are collections of models+code behind a closed API. You can find out specifically what the model can and can't do. 3. And you can directly customize it for whatever you'd like. Fine-tuning, wherein you give the model data that's tailored to your use cases and train it some more on that data, is trivial* when you have the model weights. *Provided you have the compute. 4. You can directly benchmark whatever you'd like. Biases? Energy usage? Strengths/weaknesses? Go for it. You wants it you gots it--this transparency helps people understand SOTA *in general*, not just for this model, but points to, e.g., what's going on with closed Google models as well. 5. One of the most powerful things about "openness" that I've learned is that it cultivates ecosystems of collaborators building on top of one another's brilliance to make systems that are significantly better than they would be if created in isolation. But, caveat wrt my own philosophy... 6. I do not take it as a given that advancing LLMs is good, and have a lot more to say wrt where I think innovation should focus more. For example, a focus on *data* -- curation, measurement, consent, credit, compensation, safety -- would deeply improve technology for everyone. 7. The transparency this release provides is massive for people who want to *learn* about LLMs. For the next generation of technologists to advance over the current, they MUST be able to learn about what's happening now. (cont...)

posted an update 7 days ago

🤖 👾 Thanks so much to BBC News and the stellar Suranjana Tewari for having me on to talk about US <—> China relationship in AI, and what it means for AI ethics.

View all activity

Organizations

published an article 7 months ago

Article

AI Agents Are Here. What Now?

By

and 3 others •

Jan 13

• 83

published an article 11 months ago

Article

The Environmental Impacts of AI -- Primer

By

and 2 others •

Sep 3, 2024

• 42

published an article about 1 year ago

Article

Experimenting with Automatic PII Detection on the Hub using Presidio

By

and 3 others •

Jul 10, 2024

• 25

published an article about 1 year ago

Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

By

and 9 others •

Jun 24, 2024

• 34

published an article over 1 year ago

Article

Public Policy at Hugging Face

By

and 4 others •

Apr 8, 2024

• 23

published an article over 1 year ago

Article

AI Watermarking 101: Tools and Techniques

By

and 8 others •

Feb 26, 2024

• 20

published an article almost 2 years ago

Article

Ethics and Society Newsletter #5: Hugging Face Goes To Washington and Other Summer 2023 Musings

By

•

Sep 29, 2023

• 1

published an article about 2 years ago

Article

Ethics and Society Newsletter #4: Bias in Text-to-Image Models

By

and 6 others •

Jun 26, 2023

• 2

published an article about 2 years ago

Article

AI Policy @🤗: Response to the U.S. NTIA's Request for Comment on AI Accountability

By

and 2 others •

Jun 20, 2023

• 1

published an article over 2 years ago

Article

Ethics and Society Newsletter #3: Ethical Openness at Hugging Face

By

and 6 others •

Mar 30, 2023

published an article over 2 years ago

Article

Model Cards: Introducing HF Model documentation tools

By

and 2 others •

Dec 20, 2022

published an article almost 3 years ago

Article

Evaluating Language Model Bias with 🤗 Evaluate

By

and 4 others •

Oct 24, 2022

• 5

published an article almost 3 years ago

Article

Ethics and Society Newsletter #1

By

•

Sep 22, 2022

published an article about 3 years ago

Article

Putting ethical principles at the core of research lifecycle

By

and 11 others •

May 19, 2022

published an article over 3 years ago

Article

Introducing the Data Measurements Tool: an Interactive Tool for Looking at Datasets

By

and 2 others •

Nov 29, 2021