OpenAI introduces web crawlers in preparation for GPT-5

Open AI introduced a web crawling tool named ‘GPTBot’ with the aim of enhancing the functionality of future GPT models.

According to the company, the data was collected in the following ways. GPTBot It has the potential to improve the model’s accuracy and extend its capabilities, and could be an important step in the evolution of AI-powered language models.

Web crawlers (also called web spiders) play a key role in indexing content on the vast Internet. Popular search engines such as Google and Bing make use of these bots to add relevant web pages to their search results.

OpenAI’s GPTBot has a clear purpose. Collect public data while carefully avoiding sources with content that violates paywalls, personal data collection, or OpenAI policies.

Website owners can prevent GPTBot from crawling their sites by simply implementing a “disallow” command in their standard server files. This allows you to control which parts of your content web crawlers can access.

OpenAI’s announcement comes shortly after the company filed a trademark application for “GPT-5,” which is expected to be the successor to the current GPT-4 model.

The application, filed with the U.S. Patent and Trademark Office on July 18, includes the use of GPT-5 in AI-based human speech and text, speech-to-text conversion, speech recognition, and speech synthesis. ing.

But while GPT-5’s trademark filing sparked excitement among AI enthusiasts, OpenAI CEO Sam Altman cautioned against premature expectations. Altman revealed that the company still exists. Far from starting GPT-5 trainingThis is because extensive safety audits must be carried out before embarking on the process.

OpenAI’s recent efforts have been accompanied by controversy. Concerns have arisen over the company’s data collection practices, particularly copyright and consent issues.

In June, Japan’s privacy regulator issued a warning to OpenAI over fraudulent data collection.in Italy earlier this year. temporarily banned Using ChatGPT for suspected violations of European Union privacy laws.

OpenAI and Microsoft are also currently facing class action lawsuit The lawsuit was filed by 16 plaintiffs who allege that personal information from ChatGPT user interactions was accessed without proper consent.Businesses are also taking a hit lawsuit In GitHub Copilot, plaintiffs allege that the code generation tool violated developers’ rights by scraping code without attribution.

If these allegations are found to be true, both OpenAI and Microsoft could be found to have violated the Computer Fraud and Abuse Act, a legal precedent related to web scraping cases.

As OpenAI continues to push the boundaries of AI technology, these challenges must be overcome to ensure responsible and ethical development in the AI ​​field.

(Image credit: Gerd Altmann from pixabay)

See also Meta Announces Llama 2 Open Source LLM

Want to learn more about AI and big data from industry leaders? check out AI・Big Data EXPO It will be held in Amsterdam, California and London.The event will be held at the same time as digital transformation week.

Check out other upcoming enterprise technology events and webinars from TechForge here.

  • Ryan Dawes

    Ryan is a senior editor at TechForge Media with over a decade of experience covering emerging technologies and interviewing key figures in the industry. He’s often seen at tech conferences with a strong cup of coffee in one hand and a laptop in the other. If it’s something nerdy, he’s probably into it. Find him on Twitter (@Gadget_Ry) or Mastodon (@gadgetry@techhub.social).

tag: love, artificial intelligence, gpt-5, gptbot, open night, web crawler, spiderweb

https://www.artificialintelligence-news.com/2023/08/08/openai-deploys-web-crawler-preparation-gpt-5/ OpenAI introduces web crawlers in preparation for GPT-5

Show More
Back to top button