{"id":2215,"date":"2025-03-05T07:04:33","date_gmt":"2025-03-05T07:04:33","guid":{"rendered":"https:\/\/mailitics.com\/index.php\/2025\/03\/05\/deep-research-by-openai-a-practical-test-of-ai-powered-literature-review\/"},"modified":"2025-03-05T07:04:33","modified_gmt":"2025-03-05T07:04:33","slug":"deep-research-by-openai-a-practical-test-of-ai-powered-literature-review","status":"publish","type":"post","link":"https:\/\/mailitics.com\/index.php\/2025\/03\/05\/deep-research-by-openai-a-practical-test-of-ai-powered-literature-review\/","title":{"rendered":"Deep Research by OpenAI: A Practical Test of AI-Powered Literature Review"},"content":{"rendered":"<p>    Deep Research by OpenAI: A Practical Test of AI-Powered Literature Review<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p class=\"wp-block-paragraph\" id=\"3091\">\u201cConduct a comprehensive literature review on the state-of-the-art in Machine Learning and energy consumption. [\u2026]\u201d<\/p>\n<p class=\"wp-block-paragraph\" id=\"9e88\">With this prompt, I tested the new Deep Research function, which has been integrated into the OpenAI o3 reasoning model since the end of February\u200a\u2014\u200aand conducted a state-of-the-art literature review within 6 minutes.<\/p>\n<p class=\"wp-block-paragraph\" id=\"371b\">This function goes beyond a normal web search (for example, with ChatGPT 4o): The research query is broken down &amp; structured, the Internet is searched for information, which is then evaluated, and finally, a structured, comprehensive report is created.<\/p>\n<p class=\"wp-block-paragraph\" id=\"c690\">Let\u2019s take a closer look at this.<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\" id=\"5a15\"><strong>Table of Content<\/strong><br \/><a href=\"https:\/\/towardsdatascience.com\/#what-is\">1. What is Deep Research from OpenAI and what can you do with it?<\/a><br \/><a href=\"https:\/\/towardsdatascience.com\/#how-does\">2. How does deep research work?<\/a><br \/><a href=\"https:\/\/towardsdatascience.com\/#how-to-use\">3. How can you use deep research? \u2014 Practical example<\/a><br \/><a href=\"https:\/\/towardsdatascience.com\/#challenges\">4. Challenges and risks of the Deep Research feature<\/a><br \/><a href=\"https:\/\/towardsdatascience.com\/#challenges\">Final Thoughts<\/a><br \/><a href=\"https:\/\/towardsdatascience.com\/#where-can\">Where can you continue learning?<\/a><\/p>\n<\/blockquote>\n<h2 class=\"wp-block-heading\" id=\"what-is\">1. What is Deep Research from OpenAI and what can you do with it?<\/h2>\n<p class=\"wp-block-paragraph\" id=\"e6b1\">If you have an OpenAI Plus account (the $20 per month plan), you have access to Deep Research. This gives you access to 10 queries per month. With the Pro subscription ($200 per month) you have extended access to Deep Research and access to the research preview of GPT-4.5 with 120 queries per month.<\/p>\n<p class=\"wp-block-paragraph\" id=\"e696\"><a href=\"https:\/\/help.openai.com\/en\/articles\/10500283-deep-research-faq\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAI<\/a>\u00a0promises that we can perform multi-step research using data from the public web.<\/p>\n<p class=\"wp-block-paragraph\" id=\"e1c3\">Duration: 5 to 30 minutes, depending on complexity.\u00a0<\/p>\n<p class=\"wp-block-paragraph\" id=\"3a3d\">Previously, such research usually took hours.<\/p>\n<p class=\"wp-block-paragraph\" id=\"9140\">It is intended for complex tasks that require a deep search and thoroughness.<\/p>\n<p class=\"wp-block-paragraph\" id=\"d4b8\">What do concrete use cases look like?<\/p>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">Conduct a literature review: Conduct a literature review on state-of-the-art machine learning and energy consumption.<\/li>\n<li class=\"wp-block-list-item\">Market analysis: Create a comparative report on the best marketing automation platforms for companies in 2025 based on current market trends and evaluations.<\/li>\n<li class=\"wp-block-list-item\">Technology &amp; software development: Investigate programming languages and frameworks for AI application development with performance and use case analysis<\/li>\n<li class=\"wp-block-list-item\">Investment &amp; financial analysis: Conduct research on the impact of AI-powered trading on the financial market based on recent reports and academic studies.<\/li>\n<li class=\"wp-block-list-item\">Legal research: Conduct an overview of data protection laws in Europe compared to the US, including relevant rulings and recent changes.<\/li>\n<\/ul>\n<h2 class=\"wp-block-heading\" id=\"how-does\">2. How does Deep Research work?<\/h2>\n<p class=\"wp-block-paragraph\" id=\"613f\">Deep Research uses various <a href=\"https:\/\/towardsdatascience.com\/tag\/deep-learning\/\" title=\"Deep Learning\">Deep Learning<\/a> methods to carry out a systematic and detailed analysis of information. The entire process can be divided into four main phases:<\/p>\n<h3 class=\"wp-block-heading\" id=\"e9e3\">1. Decomposition and structuring of the research question<\/h3>\n<p class=\"wp-block-paragraph\" id=\"79e3\">In the first step the tool processes the research question using natural language processing (NLP) methods. It identifies the most important key terms, concepts, and sub-questions.\u00a0<\/p>\n<p class=\"wp-block-paragraph\" id=\"7739\">This step ensures that the AI understands the question not only literally, but also in terms of content.<\/p>\n<h3 class=\"wp-block-heading\" id=\"2f3d\">2. Obtaining relevant information<\/h3>\n<p class=\"wp-block-paragraph\" id=\"9710\">Once the tool has structured the research question, it searches specifically for information. <a href=\"https:\/\/towardsdatascience.com\/tag\/deep-research\/\" title=\"Deep Research\">Deep Research<\/a> uses a mixture of internal databases, scientific publications, APIs, and web scraping. These can be open-access databases such as arXiv, PubMed, or Semantic Scholar, for example, but also public websites or news sites such as The Guardian, New York Times, or BBC. In the end, any content that can be accessed online and is publicly available.<\/p>\n<h3 class=\"wp-block-heading\" id=\"bae4\">3. Analysis &amp; interpretation of the data<\/h3>\n<p class=\"wp-block-paragraph\" id=\"02f6\">The next step is for the AI model to summarize large amounts of text into compact and understandable answers. Transformers &amp; Attention mechanisms ensure that the most important information is prioritized. This means that it does not simply create a summary of all the content found. Also, the quality and credibility of the sources is assessed. And cross-validation methods are normally used to identify incorrect or contradictory information. Here, the AI tool compares several sources with each other. However, it is not publicly known exactly how this is done in Deep Research or what criteria there are for this.<\/p>\n<h3 class=\"wp-block-heading\" id=\"9bc9\">4. Generation of the final report<\/h3>\n<p class=\"wp-block-paragraph\" id=\"3cf5\">Finally, the final report is generated and displayed to us. This is done using Natural Language Generation (NLG) so that we see easily readable texts.<\/p>\n<p class=\"wp-block-paragraph\" id=\"5cf2\">The AI system generates diagrams or tables if requested in the prompt and adapts the response to the user\u2019s style. The primary sources used are also listed at the end of the report.<\/p>\n<h2 class=\"wp-block-heading\" id=\"how-to-use\">3. How you can use Deep Research: A practical example<\/h2>\n<p class=\"wp-block-paragraph\" id=\"4e12\">In the first step, it is best to use one of the standard models to ask how you should optimize the prompt in order to conduct deep research. I have done this with the following prompt with ChatGPT 4o:<\/p>\n<p class=\"wp-block-paragraph\" id=\"8394\"><em>\u201cOptimize this prompt to conduct a deep research:<br \/>Carrying out a literature search: Carry out a literature search on the state of the art on machine learning and energy consumption.\u201d<\/em><\/p>\n<p class=\"wp-block-paragraph\" id=\"c20c\">The 4o model suggested the following prompt for the Deep Research function:<\/p>\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" data-dominant-color=\"eaeaeb\" data-has-transparency=\"true\" style=\"--dominant-color: #eaeaeb;\" fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"434\" src=\"https:\/\/i0.wp.com\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-1-1024x434.png?resize=1024%2C434&#038;ssl=1\" alt=\"Deep Research screenshot (German and English)\" class=\"wp-image-598700 has-transparency\" srcset=\"https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-1-1024x434.png 1024w, https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-1-300x127.png 300w, https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-1-768x326.png 768w, https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-1.png 1175w\" sizes=\"(max-width: 1024px) 100vw, 1024px\"><figcaption class=\"wp-element-caption\">Screenshot taken by the author<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\" id=\"71cf\">The tool then asked me if I could clarify the scope and focus of the literature review. I have, therefore, provided some additional specifications:<\/p>\n<figure class=\"wp-block-image size-full\"><img data-recalc-dims=\"1\" loading=\"lazy\" data-dominant-color=\"f3f3f3\" data-has-transparency=\"true\" style=\"--dominant-color: #f3f3f3;\" decoding=\"async\" width=\"872\" height=\"776\" src=\"https:\/\/i0.wp.com\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-2.png?resize=872%2C776&#038;ssl=1\" alt=\"Deep research screenshot\" class=\"wp-image-598701 has-transparency\" srcset=\"https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-2.png 872w, https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-2-300x267.png 300w, https:\/\/towardsdatascience.com\/wp-content\/uploads\/2025\/03\/What-is-Deep-Research-2-768x683.png 768w\" sizes=\"(max-width: 872px) 100vw, 872px\"><figcaption class=\"wp-element-caption\">Screenshot taken by the author<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\" id=\"d398\">ChatGPT then returned the clarification and started the research.<\/p>\n<p class=\"wp-block-paragraph\" id=\"82ff\">In the meantime, I could see the progress and how more sources were gradually added.<\/p>\n<p class=\"wp-block-paragraph\" id=\"bb41\">After 6 minutes, the state-of-the-art literature review was complete, and the report, including all sources, was available to me.<a href=\"https:\/\/drive.google.com\/file\/d\/1GHAi1eMxlzW2YDpBqp5szTD78_hMf_jF\/view?usp=sharing&amp;source=post_page-----08c5c2395df4---------------------------------------\" target=\"_blank\" rel=\"noreferrer noopener\"><\/a><\/p>\n<h3 class=\"wp-block-heading\"><a href=\"https:\/\/drive.google.com\/file\/d\/1GHAi1eMxlzW2YDpBqp5szTD78_hMf_jF\/view?usp=sharing&amp;source=post_page-----08c5c2395df4---------------------------------------\" target=\"_blank\" rel=\"noreferrer noopener\">Deep Research Example.mp4<\/a><\/h3>\n<h2 class=\"wp-block-heading\" id=\"challenges\">4. Challenges and risks of the Deep Research feature<\/h2>\n<p class=\"wp-block-paragraph\" id=\"2dc6\">Let\u2019s take a look at two definitions of research:<\/p>\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\">\u201cA detailed study of a subject, especially in order to discover new information or reach a new understanding.\u201d<\/p>\n<p><cite><a href=\"https:\/\/dictionary.cambridge.org\/de\/worterbuch\/englisch\/research#google_vignette\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Reference: Cambridge Dictionary<\/em><\/a><\/cite>\n<\/p><\/blockquote>\n<p class=\"wp-block-paragraph\">\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p class=\"wp-block-paragraph\">\u201cResearch is creative and systematic work undertaken to increase the stock of knowledge. It involves the collection, organization, and analysis of evidence to increase understanding of a topic, characterized by a particular attentiveness to controlling sources of bias and error.\u201d<\/p>\n<p><cite><a href=\"https:\/\/en.wikipedia.org\/wiki\/Research\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Reference: Wikipedia Research<\/em><\/a><\/cite>\n<\/p><\/blockquote>\n<p class=\"wp-block-paragraph\" id=\"ea1a\">The two definitions show that research is a detailed, systematic investigation of a topic \u2014 with the aim of discovering new information or achieving a deeper understanding.<\/p>\n<p class=\"wp-block-paragraph\" id=\"8058\">Basically, the deep research function fulfills these definitions to a certain extent: it collects existing information, analyzes it, and presents it in a structured way.<\/p>\n<p class=\"wp-block-paragraph\" id=\"8b27\">However, I think we also need to be aware of some challenges and risks:<\/p>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\">\n<strong>Danger of superficiality<\/strong>: Deep Research is primarily designed to efficiently search, summarize, and provide existing information in a structured form (at least at the current stage). Absolutely great for overview research. But what about digging deeper? Real scientific research goes beyond mere reproduction and takes a critical look at the sources. Science also thrives on generating new knowledge.<\/li>\n<li class=\"wp-block-list-item\">\n<strong>Reinforcement of existing biases in research &amp; publication<\/strong>: Papers are already more likely to be published if they have significant results. \u201cNon-significant\u201d or contradictory results, on the other hand, are less likely to be published. This is known to us as\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Publication_bias\" target=\"_blank\" rel=\"noreferrer noopener\">publication bias<\/a>. If the AI tool now primarily evaluates frequently cited papers, it reinforces this trend. Rare or less widespread but possibly important findings are lost. A possible solution here would be to implement a mechanism for weighted source evaluation that also takes into account less cited but relevant papers. If the AI methods primarily cite sources that are quoted frequently, less widespread but important findings may be lost. Presumably, this effect also applies to us humans.<\/li>\n<li class=\"wp-block-list-item\">\n<strong>Quality of research papers<\/strong>: While it is obvious that a bachelor\u2019s, master\u2019s, or doctoral thesis cannot be based solely on AI-generated research, the question I have is how universities or scientific institutions deal with this development. Students can get a solid research report with just a single prompt. Presumably, the solution here must be to adapt assessment criteria to give greater weight to in-depth reflection and methodology.<\/li>\n<\/ul>\n<h2 class=\"wp-block-heading\" id=\"final-thoughts\">Final thoughts<\/h2>\n<p class=\"wp-block-paragraph\" id=\"8796\">In addition to <a href=\"https:\/\/towardsdatascience.com\/tag\/openai\/\" title=\"OpenAI\">OpenAI<\/a>, other companies and platforms have also integrated similar functions (even before OpenAI): For example,\u00a0<a href=\"https:\/\/www.perplexity.ai\/de\/hub\/blog\/introducing-perplexity-deep-research\" rel=\"noreferrer noopener\" target=\"_blank\">Perplexity AI<\/a>\u00a0has introduced a deep research function that independently conducts and analyzes searches. Also\u00a0<a href=\"https:\/\/blog.google\/products\/gemini\/google-gemini-deep-research\/\" rel=\"noreferrer noopener\" target=\"_blank\">Gemini by Google<\/a>\u00a0has integrated such a deep research function.<\/p>\n<p class=\"wp-block-paragraph\" id=\"d19b\">The function gives you an incredibly quick overview of an initial research question. It remains to be seen how reliable the results are. Currently (beginning March 2025),\u00a0<a href=\"https:\/\/openai.com\/index\/introducing-deep-research\/\" rel=\"noreferrer noopener\" target=\"_blank\">OpenAI itself writes as limitations<\/a>\u00a0that the feature is still at an early stage, can sometimes hallucinate facts into answers or draw false conclusions, and has trouble distinguishing authoritative information from rumors. In addition, it is currently unable to accurately convey uncertainties.<\/p>\n<p class=\"wp-block-paragraph\" id=\"5e35\">But it can be assumed that this function will be expanded further and become a powerful tool for research. If you have simpler questions, it is better to use the standard GPT-4o model (with or without search), where you get an immediate answer.<\/p>\n<h2 class=\"wp-block-heading\" id=\"where-can\">Where can you continue learning?<\/h2>\n<ul class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/www.datacamp.com\/blog\/deep-research-openai\" target=\"_blank\" rel=\"noreferrer noopener\">DataCamp Blog \u2014 OpenAI\u2019s Deep Research<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/www.ibm.com\/think\/news\/openai-releases-deep-research\" target=\"_blank\" rel=\"noreferrer noopener\">IBM \u2014 OpenAI\u2019s deep research aims to outthink analysts<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/help.openai.com\/en\/articles\/10500283-deep-research-faq\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAI \u2014 Deep Research FAQ<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/cdn.openai.com\/deep-research-system-card.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAI \u2014 System Card PDF<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/openai.com\/index\/deep-research-system-card\/\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAI \u2014 System Card Deep Research<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/openai.com\/index\/introducing-deep-research\/\" target=\"_blank\" rel=\"noreferrer noopener\">OpenAI \u2014 Introducing deep research<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/www.youtube.com\/watch?v=onU5Hbb3qao\" target=\"_blank\" rel=\"noreferrer noopener\">freeCodeCamp Video \u2014 Understanding Deep Learning Research Tutorial<\/a><\/li>\n<li class=\"wp-block-list-item\"><a href=\"https:\/\/www.datacamp.com\/tutorial\/how-transformers-work\" target=\"_blank\" rel=\"noreferrer noopener\">DataCamp Blog \u2014 How Transformers Work<\/a><\/li>\n<\/ul>\n<p class=\"wp-block-paragraph\" id=\"6886\"><em>Want more tips &amp; tricks about tech, Python, data science, data engineering, machine learning and AI? Then regularly receive a summary of my most-read articles on my Substack \u2014 curated and for free. <\/em><\/p>\n<p class=\"wp-block-paragraph\" id=\"6886\"><a href=\"https:\/\/sarahleaschrch.substack.com\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Click here to subscribe to my Substack!<\/em><\/a><\/p>\n<p>The post <a href=\"https:\/\/towardsdatascience.com\/deep-research-by-openai-a-practical-test-of-ai-powered-literature-review\/\">Deep Research by OpenAI: A Practical Test of AI-Powered Literature Review<\/a> appeared first on <a href=\"https:\/\/towardsdatascience.com\/\">Towards Data Science<\/a>.<\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Sarah Lea<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/towardsdatascience.com\/deep-research-by-openai-a-practical-test-of-ai-powered-literature-review\/\">Go to original source<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Deep Research by OpenAI: A Practical Test of AI-Powered Literature Review \u201cConduct a comprehensive literature review on the state-of-the-art in Machine Learning and energy consumption. [\u2026]\u201d With this prompt, I tested the new Deep Research function, which has been integrated into the OpenAI o3 reasoning model since the end of February\u200a\u2014\u200aand conducted a state-of-the-art literature [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[62,69,1929,88,250,1930,800],"tags":[4,1244,541],"class_list":["post-2215","post","type-post","status-publish","format-standard","hentry","category-aimldsaimlds","category-artificial-intelligence","category-deep-research","category-deep-learning","category-generative-ai-tools","category-llm-applications","category-openai","tag-deep","tag-openai","tag-research"],"_links":{"self":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/2215"}],"collection":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/comments?post=2215"}],"version-history":[{"count":0,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/2215\/revisions"}],"wp:attachment":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/media?parent=2215"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/categories?post=2215"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/tags?post=2215"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}