Merge pull request #30402 from wxy/20231101-Open-Source-Definition-for-AI-Models-Need-a-Change

ATRP:published/20231101 Open Source Definition for AI Models Need a Change.md
This commit is contained in:
Xingyu.Wang 2023-11-02 22:04:33 +08:00 committed by GitHub
commit 5e06f665b6
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
2 changed files with 99 additions and 96 deletions

View File

@ -0,0 +1,99 @@
[#]: subject: "Open Source Definition for AI Models Need a Change"
[#]: via: "https://news.itsfoss.com/open-source-definition-ai/"
[#]: author: "Sourav Rudra https://news.itsfoss.com/author/sourav/"
[#]: collector: "lujun9972/lctt-scripts-1693450080"
[#]: translator: "ChatGPT"
[#]: reviewer: "wxy"
[#]: publisher: "wxy"
[#]: url: "https://linux.cn/article-16344-1.html"
AI 模型的开源定义需要改变
======
![][0]
> 你认为开源许可证应当进行演变吗?
2023 年,我们以**人工智能AI崭露头角**开始了新的一年,同时也见证了众多公司全力以赴投身于 AI。
比如说 **Mozilla**,它在 2023 年初制定了 [开源 AI 计划][1],以开发各种 AI 驱动的解决方案。而 **HuggingChat** 也成为了第一个推出 ChatGPT [开源替代品][2] 的组织。
即便是 Meta他们也不例外。他们自家的 <ruby>大型语言模型<rt>Large Language Model</rt></ruby>LLM[Llama 2][3] 项目在这一年都颇受关注,几个月前他们甚至推出了一款新的 [ChatGPT 竞争对手][4]。
然而,也有很多人开始 [提出质疑][5],主张 **Meta 的 Llama 2 模型并不像人们期望的那样开放**,查看它的开源许可证似乎更是印证了这个观点。
该许可证 **不允许拥有超过 7 亿日活跃用户的服务使用 Llama 2**,同样的,**它也不能被用于训练其他的语言模型**。
这也就意味着 Meta 对于 Llama 2 的许可证 **未能满足** <ruby>开源倡议组织<rt>Open Source Initiative</rt></ruby>OSI<ruby>[开源定义][6]<rt>Open Source Definition</rt></ruby>OSD所列出的 **全部要求**
人们可以争辩,像 [EleutherAI][7] 和 [Falcon 40B][8] 这样的机构就做出了很好的示范,展示了如何适当地处理 AI 的开源许可。
然而Meta 对此的看法却截然不同。
### 开源许可需要进化
在与 [The Verge][10] 的交谈中Meta 人工智能研究副总裁 [Joëlle Pineau][11] 为他们的立场进行了辩解。
她说,我们 **需要在信息共享的益处和可能对 Meta 商业造成的潜在成本之间寻找平衡**
这种对开源的态度让他们的研究人员能够更加专注地处理 AI 项目。她还补充说:
> 开放的方式从内部改变了我们的科研方法,它促使我们不发布任何不安全的东西,并在一开始就负起责任。
Joëlle 希望他们的生成型 AI 模型能够和他们过去的 [PyTorch][12] 项目一样受到热捧。
但是,**问题在于现有的许可证机制**。她又补充说,这些许可证并不是设计来处理那些需要利用大量多源数据的软件。
这反过来**为开发者和用户提供了有限责任**,以及,**对版权侵犯的有限赔偿**(解释为:保护)。
此外,她还指出:
> AI 模型与软件不同,涉及的风险更大,因此我认为我们应该对当前用户许可证进行改变,以更好地适应 AI 模型。
>
> 但我并不是一名律师,所以我在此问题上听从他们的意见。
我赞同她的观点,我们需要更新现有的许可方案,使之更好地适应 AI 模型,以及其他相关事务。
显而易见,**OSI 正在努力进行此事**。OSI 的执行董事 [Stefano Maffulli][13] 向 The Verge 透露,他们了解到 **当前的 OSI 批准的许可证无法满足人工智能模型的需求**
他们正在商讨如何与 AI 开发者合作,以提供一个 “**透明、无许可但安全**” 的模型访问。
他还补充说:
> 我们肯定需要重新思考许可证的方式,以解决 AI 模型中版权和授权的真正限制,同时仍遵循开源社区的一些原则。
无论未来如何,显然,**开源标准必须推动其演化,以适应新的以及即将出现的技术** ,而此类问题不仅仅局限于 AI。
对于未来几年开源许可的变革,我充满期待。
💬 对于你来说呢?你认为对于陈旧的开源标准,我们需要进行什么样的改变?
*题图MJ/e8bae5f6-606b-47db-aaea-c992c0bd143e*
--------------------------------------------------------------------------------
via: https://news.itsfoss.com/open-source-definition-ai/
作者:[Sourav Rudra][a]
选题:[lujun9972][b]
译者:[ChatGPT](https://linux.cn/lctt/ChatGPT)
校对:[wxy](https://github.com/wxy)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]: https://news.itsfoss.com/author/sourav/
[b]: https://github.com/lujun9972
[1]: https://news.itsfoss.com/mozilla-open-source-ai/
[2]: https://news.itsfoss.com/huggingchat-chatgpt/
[3]: https://ai.meta.com/llama/
[4]: https://news.itsfoss.com/meta-open-source-chatgpt/
[5]: https://www.wired.com/story/the-myth-of-open-source-ai/
[6]: https://opensource.org/osd/
[7]: https://www.eleuther.ai/
[8]: https://www.tii.ae/news/uaes-technology-innovation-institute-launches-open-source-falcon-40b-large-language-model
[9]: https://news.itsfoss.com/content/images/2023/04/Follow-us-on-Google-News.png
[10]: https://www.theverge.com/2023/10/30/23935587/meta-generative-ai-models-open-source
[11]: https://en.wikipedia.org/wiki/Jo%C3%ABlle_Pineau
[12]: https://pytorch.org/
[13]: https://twitter.com/smaffulli
[0]: https://img.linux.net.cn/data/attachment/album/202311/02/215953yyz45l5l3v4fzqyv.jpg

View File

@ -1,96 +0,0 @@
[#]: subject: "Open Source Definition for AI Models Need a Change"
[#]: via: "https://news.itsfoss.com/open-source-definition-ai/"
[#]: author: "Sourav Rudra https://news.itsfoss.com/author/sourav/"
[#]: collector: "lujun9972/lctt-scripts-1693450080"
[#]: translator: " "
[#]: reviewer: " "
[#]: publisher: " "
[#]: url: " "
Open Source Definition for AI Models Need a Change
======
Do you think that the open-source licenses should evolve?
We started the year 2023 with **Artificial Intelligence (AI) being one of the biggest catches** , we also saw many companies going all in into this.
Take for instance **Mozilla** , who entered 2023 with [plans to work on open-source AI][1] to develop various types of AI-powered solutions, or even **HuggingChat** , the [first open source alternative to ChatGPT][2].
Even Meta is no stranger to that, they have their [**Llama 2**][3] large language model (LLM) that has been making waves all year round, they even announced a **[**ChatGPT contender**][4]** a few months back.
However, many have [raised questions][5] that **Meta's Llama 2 is not as open as one would expect** , and that does seem the case when one looks at its open-source license.
The license **doesn't allow using Llama 2 for services that have over 700 million daily users** , and it also **cannot be used to train other language models**.
This also means that Meta's license for **Llama 2 doesn't meet all the requirements of the Open Source Initiative's (OSI)[Open Source Definition][6]**.
One can argue that open-source licenses implemented by the likes of [EleutherAI][7] and [Falcon 40B][8] are a good example of how open source licensing should be handled for AI.
But, Meta has a different take on it.
![][9]
### Open Source Licensing Needs to Evolve
In conversation with [The Verge][10], the VP for AI research over at Meta, [**Joëlle Pineau**][11] defended their stand.
She says that there is **a need to balance between the benefits of information sharing** and the **potential costs that could be incurred to Meta's business**.
This approach to open source has allowed their researchers to take a more focused approach in handling their AI projects. She also adds:
> Being open has internally changed how we approach research, and it drives us not to release anything that isnt very safe and be responsible at the onset.
Joëlle also hopes that they can have the same level of enthusiasm with their generative AI models that they have seen in the past with their [**PyTorch**][12] initiative,
But, the **problem lies with current licensing schemes**. She adds that, these licenses were not meant to work with software that leverage large amounts of data from a multitude of sources.
This in turn gives **limited liability to both the users and the developers** , with **limited indemnity to copyright infringement** (read as: protection against).
Furthermore, she added that:
> AI models are different from software because there are more risks involved, so I think we should evolve the current user licenses we have to fit AI models better.
> But Im not a lawyer, so I defer to them on this point.
I do agree with her on that, there is a need to update current licenses to better fit AI models, among other things.
And it appears that **the OSI are already on the job**. [**Stefano Maffulli**][13], the executive director of OSI, has told The Verge that they understand that the **current OSI-approved licenses are not up to the job for AI models**.
They are in the process of reviewing how to work with AI developers to provide a “ **transparent, permissionless, yet safe** ” access to models.
He also added that:
> We definitely have to rethink licenses in a way that addresses the real limitations of copyright and permissions in AI models while keeping many of the tenets of the open source community.
Regardless of what happens, it is clear that **open-source standards will have to evolve to accommodate new and emerging tech** , while not being just limited to AI.
I am looking forward to how the open-source licensing changes in the coming years.
_💬 What about you? Do you think that older open-source standards need to be updated?_
* * *
--------------------------------------------------------------------------------
via: https://news.itsfoss.com/open-source-definition-ai/
作者:[Sourav Rudra][a]
选题:[lujun9972][b]
译者:[译者ID](https://github.com/译者ID)
校对:[校对者ID](https://github.com/校对者ID)
本文由 [LCTT](https://github.com/LCTT/TranslateProject) 原创编译,[Linux中国](https://linux.cn/) 荣誉推出
[a]: https://news.itsfoss.com/author/sourav/
[b]: https://github.com/lujun9972
[1]: https://news.itsfoss.com/mozilla-open-source-ai/
[2]: https://news.itsfoss.com/huggingchat-chatgpt/
[3]: https://ai.meta.com/llama/
[4]: https://news.itsfoss.com/meta-open-source-chatgpt/
[5]: https://www.wired.com/story/the-myth-of-open-source-ai/
[6]: https://opensource.org/osd/
[7]: https://www.eleuther.ai/
[8]: https://www.tii.ae/news/uaes-technology-innovation-institute-launches-open-source-falcon-40b-large-language-model
[9]: https://news.itsfoss.com/content/images/2023/04/Follow-us-on-Google-News.png
[10]: https://www.theverge.com/2023/10/30/23935587/meta-generative-ai-models-open-source
[11]: https://en.wikipedia.org/wiki/Jo%C3%ABlle_Pineau
[12]: https://pytorch.org/
[13]: https://twitter.com/smaffulli