Skip to main content

OSI officially releases its definition for Open Source AI

The Open Source Initiative (OSI) today released its open source AI definition version 1.0 to clarify what constitutes open source AI. This gives the industry a standard by which to validate whether or not an AI system can be deemed Open Source AI. 

The definition covers code, model, and data information, with the latter being a contentious point due to legal and practical concerns. Mozilla, a long-time open source advocate, is partnering with OSI to promote openness in AI, advocating for transparency in AI systems.

The need to understand how AI systems work, so they can be researched, scrutinized and potentially regulated, is important to ensure the system is truly open source. Ayah Bdeir, senior strategic advisor on AI strategy at Mozilla, told SD Times on the “What the Dev?” podcast that AI systems are influenced by a number of different components – algorithms, code, hardware, data sets and more. 

As an example, she cited that there are data sets to train models, data sets to test, and data sets to fine tune, and this false sense of transparency leads organizations to claim their systems are open source. “When it comes to AI in traditional open source software, there’s a very clear separation between code that is written, a compiler that is used, and a license that is possessed. Each one of them can have an open license or a closed license and it’s very clear how each one of them applies to this concept of openness.” 

However, in AI systems, many components influence the system, Bdeir said. “There are algorithms, there’s code, there’s hardware, there are data sets. There’s a data set to train, there’s a data set to test, there’s a data set to fine tune, and sort of this idea that if the code is open, that means their AI systems are open, which is not accurate.” This does not allow the fundamental reuse or study of the system that is required under an open source mentality, which is the actual four freedoms – use, study, modify and share, she explained.

“The open source AI definition by OSI is an attempt to put a real fine point on what open source AI is and isn’t, and how to have a checklist that checks for whether something is or isn’t, so that this ambiguity between claiming that something is open source or actually doing it is not is not there anymore,” she said. 

The debate over data information was among the most controversial in coming up with the definition, Bdeir said.  How do organizations that are training their models with proprietary data protect it from being used in open source AI? Bdeir explained there are schools of thought around data in particular. In one school of thought, the data set must be made completely open and available in its exact form for this AI system to be considered open source. “Otherwise,” she said, “you cannot replicate this AI system. You cannot look at the data itself to see what it was trained on, or what it was fine tuned on, etc. And therefore it’s not really open source.”

In another school of thought, where she said some of the more hands-on builders reside, making the data available is not realistic. “Data is governed by laws that are different in different countries. Copyright laws are different in different countries, and licenses on data are not always super clear and easy to find, and if you inadvertently or mistakenly distribute data sets that you have no rights to, you are liable legally.”

The OSI solution to this problem is to talk about data information. What OSI is requiring is data information, not the data in a data set. The wording, Bdeir said, says the organization must provide “sufficiently detailed information about the data used to train the system so that a skilled person can recreate a substantially equivalent system using the same or similar data.”

The post OSI officially releases its definition for Open Source AI appeared first on SD Times.



from SD Times https://ift.tt/Oi0zdXr

Comments

Popular posts from this blog

Difference between Web Designer and Web Developer Neeraj Mishra The Crazy Programmer

Have you ever wondered about the distinctions between web developers’ and web designers’ duties and obligations? You’re not alone! Many people have trouble distinguishing between these two. Although they collaborate to publish new websites on the internet, web developers and web designers play very different roles. To put these job possibilities into perspective, consider the construction of a house. To create a vision for the house, including the visual components, the space planning and layout, the materials, and the overall appearance and sense of the space, you need an architect. That said, to translate an idea into a building, you need construction professionals to take those architectural drawings and put them into practice. Image Source In a similar vein, web development and design work together to create websites. Let’s examine the major responsibilities and distinctions between web developers and web designers. Let’s get going, shall we? What Does a Web Designer Do?

A guide to data integration tools

CData Software is a leader in data access and connectivity solutions. It specializes in the development of data drivers and data access technologies for real-time access to online or on-premise applications, databases and web APIs. The company is focused on bringing data connectivity capabilities natively into tools organizations already use. It also features ETL/ELT solutions, enterprise connectors, and data visualization. Matillion ’s data transformation software empowers customers to extract data from a wide number of sources, load it into their chosen cloud data warehouse (CDW) and transform that data from its siloed source state, into analytics-ready insights – prepared for advanced analytics, machine learning, and artificial intelligence use cases. Only Matillion is purpose-built for Snowflake, Amazon Redshift, Google BigQuery, and Microsoft Azure, enabling businesses to achieve new levels of simplicity, speed, scale, and savings. Trusted by companies of all sizes to meet

2022: The year of hybrid work

Remote work was once considered a luxury to many, but in 2020, it became a necessity for a large portion of the workforce, as the scary and unknown COVID-19 virus sickened and even took the lives of so many people around the world.  Some workers were able to thrive in a remote setting, while others felt isolated and struggled to keep up a balance between their work and home lives. Last year saw the availability of life-saving vaccines, so companies were able to start having the conversation about what to do next. Should they keep everyone remote? Should they go back to working in the office full time? Or should they do something in between? Enter hybrid work, which offers a mix of the two. A Fall 2021 study conducted by Google revealed that over 75% of survey respondents expect hybrid work to become a standard practice within their organization within the next three years.  Thus, two years after the world abruptly shifted to widespread adoption of remote work, we are declaring 20