feat: Add Official Microsoft & Gemini Skills (845+ Total)

🚀 Impact Significantly expands the capabilities of **Antigravity Awesome Skills** by integrating official skill collections from **Microsoft** and **Google Gemini**. This update increases the total skill count to **845+**, making the library even more comprehensive for AI coding assistants. ✨ Key Changes 1. New Official Skills - **Microsoft Skills**: Added a massive collection of official skills from [microsoft/skills](https://github.com/microsoft/skills). - Includes Azure, .NET, Python, TypeScript, and Semantic Kernel skills. - Preserves the original directory structure under `skills/official/microsoft/`. - Includes plugin skills from the `.github/plugins` directory. - **Gemini Skills**: Added official Gemini API development skills under `skills/gemini-api-dev/`. 2. New Scripts & Tooling - **`scripts/sync_microsoft_skills.py`**: A robust synchronization script that: - Clones the official Microsoft repository. - Preserves the original directory heirarchy. - Handles symlinks and plugin locations. - Generates attribution metadata. - **`scripts/tests/inspect_microsoft_repo.py`**: Debug tool to inspect the remote repository structure. - **`scripts/tests/test_comprehensive_coverage.py`**: Verification script to ensure 100% of skills are captured during sync. 3. Core Improvements - **`scripts/generate_index.py`**: Enhanced frontmatter parsing to safely handle unquoted values containing `@` symbols and commas (fixing issues with some Microsoft skill descriptions). - **`package.json`**: Added `sync:microsoft` and `sync:all-official` scripts for easy maintenance. 4. Documentation - Updated `README.md` to reflect the new skill counts (845+) and added Microsoft/Gemini to the provider list. - Updated `CATALOG.md` and `skills_index.json` with the new skills. 🧪 Verification - Ran `scripts/tests/test_comprehensive_coverage.py` to verify all Microsoft skills are detected. - Validated `generate_index.py` fixes by successfully indexing the new skills.
2026-02-11 20:16:23 +05:00
parent 167d7c97c7
commit 17bce709de
145 changed files with 44081 additions and 72 deletions
--- a/skills/official/microsoft/python/foundry/contentunderstanding/SKILL.md
+++ b/skills/official/microsoft/python/foundry/contentunderstanding/SKILL.md
@@ -0,0 +1,273 @@
+---
+name: azure-ai-contentunderstanding-py
+description: |
+  Azure AI Content Understanding SDK for Python. Use for multimodal content extraction from documents, images, audio, and video.
+  Triggers: "azure-ai-contentunderstanding", "ContentUnderstandingClient", "multimodal analysis", "document extraction", "video analysis", "audio transcription".
+package: azure-ai-contentunderstanding
+---
+
+# Azure AI Content Understanding SDK for Python
+
+Multimodal AI service that extracts semantic content from documents, video, audio, and image files for RAG and automated workflows.
+
+## Installation
+
+```bash
+pip install azure-ai-contentunderstanding
+```
+
+## Environment Variables
+
+```bash
+CONTENTUNDERSTANDING_ENDPOINT=https://<resource>.cognitiveservices.azure.com/
+```
+
+## Authentication
+
+```python
+import os
+from azure.ai.contentunderstanding import ContentUnderstandingClient
+from azure.identity import DefaultAzureCredential
+
+endpoint = os.environ["CONTENTUNDERSTANDING_ENDPOINT"]
+credential = DefaultAzureCredential()
+client = ContentUnderstandingClient(endpoint=endpoint, credential=credential)
+```
+
+## Core Workflow
+
+Content Understanding operations are asynchronous long-running operations:
+
+1. **Begin Analysis** — Start the analysis operation with `begin_analyze()` (returns a poller)
+2. **Poll for Results** — Poll until analysis completes (SDK handles this with `.result()`)
+3. **Process Results** — Extract structured results from `AnalyzeResult.contents`
+
+## Prebuilt Analyzers
+
+| Analyzer | Content Type | Purpose |
+|----------|--------------|---------|
+| `prebuilt-documentSearch` | Documents | Extract markdown for RAG applications |
+| `prebuilt-imageSearch` | Images | Extract content from images |
+| `prebuilt-audioSearch` | Audio | Transcribe audio with timing |
+| `prebuilt-videoSearch` | Video | Extract frames, transcripts, summaries |
+| `prebuilt-invoice` | Documents | Extract invoice fields |
+
+## Analyze Document
+
+```python
+import os
+from azure.ai.contentunderstanding import ContentUnderstandingClient
+from azure.ai.contentunderstanding.models import AnalyzeInput
+from azure.identity import DefaultAzureCredential
+
+endpoint = os.environ["CONTENTUNDERSTANDING_ENDPOINT"]
+client = ContentUnderstandingClient(
+    endpoint=endpoint,
+    credential=DefaultAzureCredential()
+)
+
+# Analyze document from URL
+poller = client.begin_analyze(
+    analyzer_id="prebuilt-documentSearch",
+    inputs=[AnalyzeInput(url="https://example.com/document.pdf")]
+)
+
+result = poller.result()
+
+# Access markdown content (contents is a list)
+content = result.contents[0]
+print(content.markdown)
+```
+
+## Access Document Content Details
+
+```python
+from azure.ai.contentunderstanding.models import MediaContentKind, DocumentContent
+
+content = result.contents[0]
+if content.kind == MediaContentKind.DOCUMENT:
+    document_content: DocumentContent = content  # type: ignore
+    print(document_content.start_page_number)
+```
+
+## Analyze Image
+
+```python
+from azure.ai.contentunderstanding.models import AnalyzeInput
+
+poller = client.begin_analyze(
+    analyzer_id="prebuilt-imageSearch",
+    inputs=[AnalyzeInput(url="https://example.com/image.jpg")]
+)
+result = poller.result()
+content = result.contents[0]
+print(content.markdown)
+```
+
+## Analyze Video
+
+```python
+from azure.ai.contentunderstanding.models import AnalyzeInput
+
+poller = client.begin_analyze(
+    analyzer_id="prebuilt-videoSearch",
+    inputs=[AnalyzeInput(url="https://example.com/video.mp4")]
+)
+
+result = poller.result()
+
+# Access video content (AudioVisualContent)
+content = result.contents[0]
+
+# Get transcript phrases with timing
+for phrase in content.transcript_phrases:
+    print(f"[{phrase.start_time} - {phrase.end_time}]: {phrase.text}")
+
+# Get key frames (for video)
+for frame in content.key_frames:
+    print(f"Frame at {frame.time}: {frame.description}")
+```
+
+## Analyze Audio
+
+```python
+from azure.ai.contentunderstanding.models import AnalyzeInput
+
+poller = client.begin_analyze(
+    analyzer_id="prebuilt-audioSearch",
+    inputs=[AnalyzeInput(url="https://example.com/audio.mp3")]
+)
+
+result = poller.result()
+
+# Access audio transcript
+content = result.contents[0]
+for phrase in content.transcript_phrases:
+    print(f"[{phrase.start_time}] {phrase.text}")
+```
+
+## Custom Analyzers
+
+Create custom analyzers with field schemas for specialized extraction:
+
+```python
+# Create custom analyzer
+analyzer = client.create_analyzer(
+    analyzer_id="my-invoice-analyzer",
+    analyzer={
+        "description": "Custom invoice analyzer",
+        "base_analyzer_id": "prebuilt-documentSearch",
+        "field_schema": {
+            "fields": {
+                "vendor_name": {"type": "string"},
+                "invoice_total": {"type": "number"},
+                "line_items": {
+                    "type": "array",
+                    "items": {
+                        "type": "object",
+                        "properties": {
+                            "description": {"type": "string"},
+                            "amount": {"type": "number"}
+                        }
+                    }
+                }
+            }
+        }
+    }
+)
+
+# Use custom analyzer
+from azure.ai.contentunderstanding.models import AnalyzeInput
+
+poller = client.begin_analyze(
+    analyzer_id="my-invoice-analyzer",
+    inputs=[AnalyzeInput(url="https://example.com/invoice.pdf")]
+)
+
+result = poller.result()
+
+# Access extracted fields
+print(result.fields["vendor_name"])
+print(result.fields["invoice_total"])
+```
+
+## Analyzer Management
+
+```python
+# List all analyzers
+analyzers = client.list_analyzers()
+for analyzer in analyzers:
+    print(f"{analyzer.analyzer_id}: {analyzer.description}")
+
+# Get specific analyzer
+analyzer = client.get_analyzer("prebuilt-documentSearch")
+
+# Delete custom analyzer
+client.delete_analyzer("my-custom-analyzer")
+```
+
+## Async Client
+
+```python
+import asyncio
+import os
+from azure.ai.contentunderstanding.aio import ContentUnderstandingClient
+from azure.ai.contentunderstanding.models import AnalyzeInput
+from azure.identity.aio import DefaultAzureCredential
+
+async def analyze_document():
+    endpoint = os.environ["CONTENTUNDERSTANDING_ENDPOINT"]
+    credential = DefaultAzureCredential()
+    
+    async with ContentUnderstandingClient(
+        endpoint=endpoint,
+        credential=credential
+    ) as client:
+        poller = await client.begin_analyze(
+            analyzer_id="prebuilt-documentSearch",
+            inputs=[AnalyzeInput(url="https://example.com/doc.pdf")]
+        )
+        result = await poller.result()
+        content = result.contents[0]
+        return content.markdown
+
+asyncio.run(analyze_document())
+```
+
+## Content Types
+
+| Class | For | Provides |
+|-------|-----|----------|
+| `DocumentContent` | PDF, images, Office docs | Pages, tables, figures, paragraphs |
+| `AudioVisualContent` | Audio, video files | Transcript phrases, timing, key frames |
+
+Both derive from `MediaContent` which provides basic info and markdown representation.
+
+## Model Imports
+
+```python
+from azure.ai.contentunderstanding.models import (
+    AnalyzeInput,
+    AnalyzeResult,
+    MediaContentKind,
+    DocumentContent,
+    AudioVisualContent,
+)
+```
+
+## Client Types
+
+| Client | Purpose |
+|--------|---------|
+| `ContentUnderstandingClient` | Sync client for all operations |
+| `ContentUnderstandingClient` (aio) | Async client for all operations |
+
+## Best Practices
+
+1. **Use `begin_analyze` with `AnalyzeInput`** — this is the correct method signature
+2. **Access results via `result.contents[0]`** — results are returned as a list
+3. **Use prebuilt analyzers** for common scenarios (document/image/audio/video search)
+4. **Create custom analyzers** only for domain-specific field extraction
+5. **Use async client** for high-throughput scenarios with `azure.identity.aio` credentials
+6. **Handle long-running operations** — video/audio analysis can take minutes
+7. **Use URL sources** when possible to avoid upload overhead