IT Trends

[AI Agent] 웹사이트 글 정리해주는 AI 에이전트

728x90

1) 웹사이트 링크 데이터를 읽어와
2) FAISS 와 같은 벡터 스페이스에 저장한 다음
3) 웹사이트 데이터를 요약해주는 LLM 체인 생성
Streamlit 이용해 웹으로 확인

필요한 라이브러리 모듈 불러오기

import dotenv
dotenv.load_dotenv()

from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain_openai import OpenAIEmbeddings
from langchain_community.vectorstores import Chroma
from langchain_community.document_loaders import WebBaseLoader
from langchain.chains import RetrievalQA
from langchain_openai import ChatOpenAI
from langchain.prompts import PromptTemplate
import streamlit as st

웹사이트 링크 입력하는 부분 만들기 (steamlit)

# 웹사이트 제목
st.title("Summarizer")

def get_text() 
	"""웹사이트 링크 입력"""
    input_text = st.text_input("Type in the website below", key="input")
    return input_text

# 웹사이트 링크 받기
user_input = get_text()

LLM 생성 및 실행

# 웹사이트가 입력 되면
if(user_input) :
	# 웹사이트 데이터 읽어오기
    loader = WebBaseLoader(user_input)
    data = loader.load()
	
    # 읽어온 데이터를 chunking 하기
    text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=50)
    all_splits = text_splitter.split_documents(data)

	# chunking 한 데이터를 vectordb에 저장
    vectorsotre = Chroma.from_documents(documents=all_splits, embedding=OpenAIEmbeddings())
    
    # LLM chain 생성
    question = "Summarize the main points of this texts"
    template = """Use the following pieces of context to answer the question at the end. 
    If you don't know the answer, just say that you don't know, don't try to make up an answer. 
    Return the result in Korean.
    {context}
    Question: {question}
    Helpful Answer:"""
    QA_CHAIN_PROMPT = PromptTemplate.from_template(template)

    llm = ChatOpenAI(model_name="gpt-3.5-turbo", temperature=0)
    qa_chain = RetrievalQA.from_chain_type(
        llm, 
        retriever=vectorsotre.as_retriever(),
        chain_type_kwargs={"prompt": QA_CHAIN_PROMPT}
    )
	
    # LLM Chain 실행
    result = qa_chain({"query" : question})
    result["result"]

REF :

심심해서 해커톤 참여해서 10분 안에 아이디어 만들고 논 후기 | Disquiet*

10분 안에 웹사이트 글 정리해주는 AI 에이전트 만들기안녕하세요! 오랜만에 개발글로 돌아왔습니다.오늘은 정말 간단하게 10분 안에웹사이트 링크 데이터를 스크레이프하고faiss와 같은 백터스

disquiet.io

저작자표시 비영리 변경금지

'IT Trends' 카테고리의 다른 글

[AI Agent] 코드 100줄로 쏘카 팀이 만든 고객 서비스 챗봇 만들기 (1)	2024.07.22
[IT Issues] Horangi / RAFT / LLM Guide book in 2024 / Podgenai (0)	2024.04.04
[IT Issues] supercomputer StargateGPT-5 / GPT-5 / DBRX / Mindlogic Chatbot / LogicKor / AQUA Voice (0)	2024.04.03
[IT Issues] Grok-1.5 / OpenVoice / Jamba / DUSt3R (1)	2024.04.01
LINER PDF Chat Tutorial (3) Pinecone + Mixtral (HuggigFace) (1)	2024.03.08

Contents

새소식

[AI Agent] 웹사이트 글 정리해주는 AI 에이전트

필요한 라이브러리 모듈 불러오기

웹사이트 링크 입력하는 부분 만들기 (steamlit)

LLM 생성 및 실행

'IT Trends' 카테고리의 다른 글

당신이 좋아할만한 콘텐츠

티스토리툴바