forked from kubernetes-sigs/gateway-api-inference-extension
-
Notifications
You must be signed in to change notification settings - Fork 1
Expand file tree
/
Copy pathmkdocs.yml
More file actions
89 lines (89 loc) · 2.86 KB
/
Copy pathmkdocs.yml
File metadata and controls
89 lines (89 loc) · 2.86 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
site_name: Kubernetes Gateway API Inference Extension
repo_url: https://github.com/kubernetes-sigs/gateway-api-inference-extension
repo_name: kubernetes-sigs/gateway-api-inference-extension
site_dir: site
docs_dir: site-src
extra_css:
- stylesheets/extra.css
theme:
name: material
icon:
repo: fontawesome/brands/git-alt
logo: images/logo/logo-text-large-horizontal-white.png
favicon: images/favicon-64.png
features:
- search.highlight
- navigation.tabs
- navigation.top
- navigation.expand
palette:
primary: custom
custom_dir: site-src/overrides
edit_uri: edit/main/site-src/
plugins:
- search
- awesome-pages
- macros:
j2_line_comment_prefix: "#$"
- mermaid2
markdown_extensions:
- admonition
- meta
- pymdownx.emoji:
emoji_index: !!python/name:material.extensions.emoji.twemoji
emoji_generator: !!python/name:materialx.emoji.to_svg
- pymdownx.details
- pymdownx.highlight
- pymdownx.inlinehilite
- pymdownx.superfences:
custom_fences:
- name: mermaid
class: mermaid
format: !!python/name:pymdownx.superfences.fence_code_format
- pymdownx.snippets
- toc:
permalink: true
- tables
- pymdownx.superfences
- pymdownx.tabbed:
alternate_style: true
nav:
- Overview:
- Introduction: index.md
- Concepts:
API Overview: concepts/api-overview.md
Design Principles: concepts/design-principles.md
Conformance: concepts/conformance.md
Roles and Personas: concepts/roles-and-personas.md
- Implementations:
- Gateways: implementations/gateways.md
- Model Servers: implementations/model-servers.md
- FAQ: faq.md
- Guides:
- User Guides:
- Getting started: guides/index.md
- Use Cases:
- Serve Multiple GenAI models: guides/serve-multiple-genai-models.md
- Serve Multiple LoRA adapters: guides/serve-multiple-lora-adapters.md
- Rollout:
- Adapter Rollout: guides/adapter-rollout.md
- InferencePool Rollout: guides/inferencepool-rollout.md
- Metrics and Observability: guides/metrics-and-observability.md
- Configuration Guide:
- Configuring the plugins via configuration files or text: guides/epp-configuration/config-text.md
- Prefix Cache Aware Plugin: guides/epp-configuration/prefix-aware.md
- Implementer Guides:
- Getting started: guides/implementers.md
- Conformance Tests: guides/conformance-tests.md
- Performance:
- Benchmark: performance/benchmark/index.md
- Regression Testing: performance/regression-testing/index.md
- Reference:
- API Reference: reference/spec.md
- API Types:
- InferencePool: api-types/inferencepool.md
- InferenceModel: api-types/inferencemodel.md
- Enhancements:
- Overview: gieps/overview.md
- Contributing:
- How to Get Involved: contributing/index.md