Skip to main content

GitHub Copilot has a confidence problem

The progress that has been made in generative AI (GenAI) technology is nothing short of astounding, and yet, not without its flaws. But these flaws aren’t surprising when you consider that these artificial neural networks are modeled after something equally as impressive and flawed: human intelligence. As such, GenAI falls victim to many of the same problems that keep litigators in new golf shoes. Namely, poor output and the potential for copyright infringement. Both things can result in a coder feeling less confident about using the code generated by GenAI.

The case for context

GitHub Copilot can help bolster the confidence in code, both in terms of its quality and in mitigating the risk of litigation by citing its sources. If a GenAI tool can display the original sources of the code it is using to generate its output, similar to how an online plagiarism checker links back to original source content, a developer would be in a better position to judge whether that code is from a trusted, friendly source, and not a litigious competitor or unreliable organization.

As Copilot learns how to produce creative output from the great source pool of data lakes and large language models (LLMs), and as the gray matter of its neural networks is further refined by upgrades, it and other GenAI platforms will no doubt smooth out the rough edges of their early days and produce increasingly seamless, more original creative output. As they do so, they will also further blur the lines of copyright infringement, facts, and pure fiction, much the way humans already do.

Creativity has always been somewhat derivative

Humans learn to create from mimicking their family, teachers, mentors and peers. Having absorbed all we can, we begin to produce our own creative work, which often begins with derivative attempts to find one’s own style. With lots of persistence and a bit of talent, we strive to develop in a unique style that sets us apart from our peers and earns us some recognition. This process is played out by musicians, writers, painters, and yes, software developers. As the saying goes, “There is nothing new under the sun.” But with a little creativity, we can put our own spin on our work.

In the creative arts, acts of copyright infringement—both conscious or not—occur quite regularly. There have been countless high-profile music industry copyright infringement cases involving claims of plagiarized pop songs (Marvin Gaye vs. Robin Thicke & Pharrell Williams; Spirit vs. Led Zeppelin; etc.)  and works of literary fiction that have sought to define what is protected by copyright law. The most egregious instances may be punished by legal action, especially where there is some compensation to be gained. The fact that these cases take so long to dispute says something about the quality of the derivative work. Were there very clear evidence of copyright infringement, there would be very short work for litigators. It is within the smoothness of the edges, the lines distinguishing one work from another, that originality exists. 

GenAI is not at the level of creativity that is required to produce truly brilliant, original work. And that is precisely where the developer skills come into play. 

The road to human confidence

Already, GitHub Copilot offers a “Suggestions matching public code” filter that helps the user avoid using direct copies of code snippets by checking surrounding code (within a 150-character limit). It’s a safeguard against blatant copying that can help to reduce liability for plagiarism. But only an experienced developer has the judgment to know when proposed code is ultimately usable. Between its reliance on poor sources to its proclivity for outright hallucinations, GenAI cannot be trusted to write code without human oversight. Its role as an assistant, however, is clearly valuable, particularly when asked the right questions in the right syntax. 

The potential for GitHub Copilot and other GenAI technologies to make light work of creative work from prose, poetry and song to executable computer code is increasing every day. What these technologies lack is the judgment to know when something is unique and of high quality. What these machines lack in confidence, they provide to the developer in insight and quickly generated ideas. Together, humans and AI can make a very impressive team.

The post GitHub Copilot has a confidence problem appeared first on SD Times.



from SD Times https://ift.tt/dw7gbZL

Comments

Popular posts from this blog

Difference between Web Designer and Web Developer Neeraj Mishra The Crazy Programmer

Have you ever wondered about the distinctions between web developers’ and web designers’ duties and obligations? You’re not alone! Many people have trouble distinguishing between these two. Although they collaborate to publish new websites on the internet, web developers and web designers play very different roles. To put these job possibilities into perspective, consider the construction of a house. To create a vision for the house, including the visual components, the space planning and layout, the materials, and the overall appearance and sense of the space, you need an architect. That said, to translate an idea into a building, you need construction professionals to take those architectural drawings and put them into practice. Image Source In a similar vein, web development and design work together to create websites. Let’s examine the major responsibilities and distinctions between web developers and web designers. Let’s get going, shall we? What Does a Web Designer Do?...

A guide to data integration tools

CData Software is a leader in data access and connectivity solutions. It specializes in the development of data drivers and data access technologies for real-time access to online or on-premise applications, databases and web APIs. The company is focused on bringing data connectivity capabilities natively into tools organizations already use. It also features ETL/ELT solutions, enterprise connectors, and data visualization. Matillion ’s data transformation software empowers customers to extract data from a wide number of sources, load it into their chosen cloud data warehouse (CDW) and transform that data from its siloed source state, into analytics-ready insights – prepared for advanced analytics, machine learning, and artificial intelligence use cases. Only Matillion is purpose-built for Snowflake, Amazon Redshift, Google BigQuery, and Microsoft Azure, enabling businesses to achieve new levels of simplicity, speed, scale, and savings. Trusted by companies of all sizes to meet...

2022: The year of hybrid work

Remote work was once considered a luxury to many, but in 2020, it became a necessity for a large portion of the workforce, as the scary and unknown COVID-19 virus sickened and even took the lives of so many people around the world.  Some workers were able to thrive in a remote setting, while others felt isolated and struggled to keep up a balance between their work and home lives. Last year saw the availability of life-saving vaccines, so companies were able to start having the conversation about what to do next. Should they keep everyone remote? Should they go back to working in the office full time? Or should they do something in between? Enter hybrid work, which offers a mix of the two. A Fall 2021 study conducted by Google revealed that over 75% of survey respondents expect hybrid work to become a standard practice within their organization within the next three years.  Thus, two years after the world abruptly shifted to widespread adoption of remote work, we are dec...