feat(project): 实现AI项目初始化及文档解析功能

- 新增DocumentParserUtil工具类,支持PDF、Word、Excel、Markdown及文本解析
- 基于MinIO实现OssService,支持文件上传、下载、删除及URL生成
- 添加ProjectService实现,利用Spring AI ChatClient解析项目文档生成结构化数据
- 新增ProjectController,提供文件上传接口供项目初始化调用
- 配置开发环境application-dev.yaml,包含数据库、MinIO及Spring AI相关配置
- 添加pom.xml,集成必要依赖如Spring AI、MinIO、Apache POI、PDFBox、Tika和Flexmark等组件
This commit is contained in:
2026-03-26 17:59:18 +08:00
parent 4656090683
commit 852cbd60a0
6 changed files with 342 additions and 42 deletions

47
pom.xml
View File

@@ -81,11 +81,50 @@
<version>8.5.7</version>
</dependency>
<!-- aws-java 用于存储桶 -->
<!-- 文档解析依赖 -->
<!-- PDF解析 -->
<dependency>
<groupId>com.amazonaws</groupId>
<artifactId>aws-java-sdk-s3</artifactId>
<version>1.12.700</version>
<groupId>org.apache.pdfbox</groupId>
<artifactId>pdfbox</artifactId>
<version>2.0.30</version>
</dependency>
<!-- Word文档解析 -->
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-ooxml</artifactId>
<version>5.2.5</version>
</dependency>
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-scratchpad</artifactId>
<version>5.2.5</version>
</dependency>
<!-- Excel解析 -->
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-ooxml-schemas</artifactId>
<version>4.1.2</version>
</dependency>
<!-- Markdown解析 -->
<dependency>
<groupId>com.vladsch.flexmark</groupId>
<artifactId>flexmark-all</artifactId>
<version>0.64.8</version>
</dependency>
<!-- Tika 用于通用文档类型检测和内容提取 -->
<dependency>
<groupId>org.apache.tika</groupId>
<artifactId>tika-core</artifactId>
<version>2.9.1</version>
</dependency>
<dependency>
<groupId>org.apache.tika</groupId>
<artifactId>tika-parsers-standard-package</artifactId>
<version>2.9.1</version>
</dependency>
<dependency>