Evaluation of Large Language Models in Contract Information Extraction

SILVA, Weybson Alves da

Use este identificador para citar ou linkar para este item: https://repositorio.ufpe.br/handle/123456789/58332

Compartilhe esta página

Registro completo de metadados

Campo DC	Valor	Idioma
dc.contributor.advisor	Ren, Tsang Ing	-
dc.contributor.author	SILVA, Weybson Alves da	-
dc.date.accessioned	2024-10-30T11:38:23Z	-
dc.date.available	2024-10-30T11:38:23Z	-
dc.date.issued	2024-10-09	-
dc.date.submitted	2024-10-29	-
dc.identifier.citation	SILVA, Weybson Alves da. Evaluation of Large Language Models in Contract Information Extraction. 2024. Trabalho de Conclusão de Curso (Ciência da Computação) – Universidade Federal de Pernambuco, Recife, 2024.	pt_BR
dc.identifier.uri	https://repositorio.ufpe.br/handle/123456789/58332	-
dc.description.abstract	Despite the rapid advancement of Large Language Models (LLMs), there is limited research focused on their effectiveness in extracting specific information from contracts. This study evaluates the effectiveness of state-of-the-art models—GPT-3.5-Turbo, Gemini-1.5-Pro, Claude-3.5-Sonnet, and Llama-3-70B-Instruct—in extracting key clauses from contracts using the Contract Understanding Atticus Dataset (CUAD). We explore the impact of prompting strategies and input context configurations across two scenarios: one covering all 41 clause categories and another focusing on a subset of three. Our findings reveal that LLMs can extract contract information efficiently, outperforming traditional human review in terms of time and cost. Performance, however, varies significantly depending on context size and task specificity, with reduced context approaches and focused extractions often improving recall at the expense of precision. Notably, Claude-3.5-Sonnet, with zero-shot with output example and reduced context, achieved a recall of 0.77 and precision of 0.66, surpassing prior benchmarks on full-category extraction. However, performance is inconsistent across clause types. Models like Llama-3-70B-Instruct, while less robust, demonstrated strong performance on simpler tasks, highlighting their potential in targeted use cases. Additionally, retrieval-augmented generation shows potential for improving extraction and efficiency in long documents, though its performance is constrained by retriever accuracy. Our experiments suggest that with further refinement, LLMs could be vital in automating complex legal tasks, particularly in efficiently handling dense legal texts such as contracts.	pt_BR
dc.format.extent	28p.	pt_BR
dc.language.iso	eng	pt_BR
dc.rights	openAccess	pt_BR
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/br/	*
dc.subject	Information Extraction	pt_BR
dc.subject	Contract Review	pt_BR
dc.subject	Large Language Models	pt_BR
dc.subject	Natural Language Processing	pt_BR
dc.title	Evaluation of Large Language Models in Contract Information Extraction	pt_BR
dc.type	bachelorThesis	pt_BR
dc.contributor.authorLattes	https://lattes.cnpq.br/5639027966274673	pt_BR
dc.degree.level	Graduacao	pt_BR
dc.contributor.advisorLattes	http://lattes.cnpq.br/3084134533707587	pt_BR
dc.subject.cnpq	Áreas::Ciências Exatas e da Terra::Ciência da Computação	pt_BR
dc.degree.departament	::(CIN-DCC) - Departamento de Ciência da Computação	pt_BR
dc.degree.graduation	::CIn-Curso de Ciência da Computação	pt_BR
dc.degree.grantor	Universidade Federal de Pernambuco	pt_BR
dc.degree.local	Recife	pt_BR
Aparece nas coleções:	(TCC) - Ciência da Computação

Arquivos associados a este item:

Arquivo	Descrição	Tamanho	Formato
TCC Weybson Alves da Silva.pdf		3,15 MB	Adobe PDF	Visualizar/Abrir

Este arquivo é protegido por direitos autorais

Ver licença

Mostrar registro simples do item Recomendar este item Visualizar estatísticas

Este item está licenciada sob uma Licença Creative Commons