Benchmark Results
Task-level results from our evaluation of Coral vs direct provider MCPs. Read the full report.
| ID | Task | Runner | Latency | Tools | Tokens | Cost | Facts |
|---|---|---|---|---|---|---|---|
| 1 | Do we have any duplicate dashboard names in our monitoring tool? If so, list them with their dashboard IDs. Simple Single source | MCP | 14s | 4 | 54,017 | $0.08 | 2/2 |
| Coral | 15.4s | 6 | 65,984 | $0.08 | 2/2 | ||
| Delta | 10% | 50% | 22% | -8% | |||
| 2 | How well are we estimating our work in the project management tool? What percentage of issues have story point estimates? Are there teams or priorities where estimation is better or worse? Complex Single source | MCP | 157.2s | 36 | 608,636 | $0.71 | 3/3 |
| Coral | 81.7s | 15 | 115,369 | $0.17 | 3/3 | ||
| Delta | -48% | -58% | -81% | -76% | |||
| 3 | What hosts do we have in our infrastructure? Describe the naming patterns and identify any that are specifically for titaness. Complex Single source | MCP | 76s | 44 | 1,241,588 | $0.33 | 2/2 |
| Coral | 38s | 10 | 97,580 | $0.16 | 2/2 | ||
| Delta | -50% | -77% | -92% | -53% | |||
| 4 | What strategic initiatives are we tracking in our project management tool? Do any of them have descriptions? Simple Single source | MCP | 16.9s | 4 | 53,386 | $0.08 | 2/2 |
| Coral | 16.6s | 6 | 66,613 | $0.08 | 2/2 | ||
| Delta | -2% | 50% | 25% | -1% | |||
| 5 | What is the priority distribution of issues in our project management tool? How many issues are at each priority level? Which priority has the most issues? Complex Single source | MCP | 119.3s | 33 | 571,711 | $0.54 | 3/3 |
| Coral | 29.1s | 9 | 90,543 | $0.11 | 3/3 | ||
| Delta | -76% | -73% | -84% | -79% | |||
| 6 | What workflow states are issues in across our project management tool? Give me a breakdown of how many issues are in each state. Simple Single source | MCP | 35.2s | 10 | 130,396 | $0.16 | 3/3 |
| Coral | 34.7s | 11 | 107,903 | $0.13 | 3/3 | ||
| Delta | -1% | 10% | -17% | -18% | |||
| 7 | What labels do we use to categorize issues in our project management tool? How many labels are there and what are some notable ones? Simple Single source | MCP | 18.9s | 4 | 51,699 | $0.08 | 2/2 |
| Coral | 35s | 12 | 107,201 | $0.14 | 2/2 | ||
| Delta | 85% | 200% | 107% | 89% | |||
| 8 | Who are the admins in our project management tool? Simple Single source | MCP | 13.4s | 4 | 54,669 | $0.12 | 3/3 |
| Coral | 15.4s | 4 | 69,809 | $0.10 | 3/3 | ||
| Delta | 15% | 0% | 28% | -18% | |||
| 9 | Where do our monitoring alerts get sent when they trigger? What notification channels are configured for our monitors? Complex Single source | MCP | 62.1s | 24 | 716,587 | $0.29 | 2/2 |
| Coral | 22s | 6 | 69,140 | $0.10 | 2/2 | ||
| Delta | -65% | -75% | -90% | -65% | |||
| 10 | What services and namespaces are our monitors watching? I want to understand the scope of our monitoring coverage — what infrastructure and applications are being tracked. Complex Single source | MCP | 58.3s | 23 | 597,064 | $0.30 | 2/2 |
| Coral | 112.8s | 49 | 596,992 | $0.33 | 2/2 | ||
| Delta | 94% | 113% | 0% | 9% | |||
| 11 | Give me a health check of our monitoring system. How many monitors do we have in each status (OK, alerting, warning, etc.)? What percentage of our monitors are currently healthy? Simple Single source | MCP | 20.9s | 5 | 75,495 | $0.11 | 3/3 |
| Coral | 36.2s | 14 | 164,964 | $0.18 | 3/3 | ||
| Delta | 73% | 180% | 119% | 64% | |||
| 12 | Give me a breakdown of our monitoring setup. What types of monitors do we have and how many of each? Are any alerting? Complex Single source | MCP | 74.5s | 34 | 944,822 | $0.32 | 2/2 |
| Coral | 23s | 10 | 108,993 | $0.13 | 2/2 | ||
| Delta | -69% | -71% | -88% | -58% | |||
| 13 | Which projects in our project management tool have target dates set? Are any past their target date or coming up soon? Complex Single source | MCP | 70.8s | 7 | 215,150 | $0.54 | 3/3 |
| Coral | 35.2s | 10 | 108,639 | $0.18 | 3/3 | ||
| Delta | -50% | 43% | -50% | -68% | |||
| 14 | What is the health of our projects in the project management tool? Break down by state — how many are in progress, backlog, completed, etc.? Complex Single source | MCP | 208.2s | 8 | 303,351 | $0.84 | 3/3 |
| Coral | 46.6s | 11 | 99,031 | $0.18 | 3/3 | ||
| Delta | -78% | 38% | -67% | -78% | |||
| 15 | Which projects in our project management tool are completed? List the completed projects. Simple Single source | MCP | 26.3s | 4 | 64,812 | $0.17 | 3/3 |
| Coral | 27.1s | 10 | 107,634 | $0.14 | 3/3 | ||
| Delta | 3% | 150% | 66% | -13% | |||
| 16 | How many unresolved issues does each project have in our error tracking tool? Which project has the most unresolved errors and what are the most frequent ones? Complex Single source | MCP | 46.8s | 11 | 106,735 | $0.22 | 2/2 |
| Coral | 32.6s | 12 | 107,206 | $0.14 | 2/2 | ||
| Delta | -30% | 9% | 0% | -37% | |||
| 17 | Which issues in our error tracking tool affect the most users? Look at the titaness project and tell me which issues have the highest user count. Simple Single source | MCP | 40.1s | 11 | 115,616 | $0.15 | 3/3 |
| Coral | 63.8s | 17 | 229,520 | $0.23 | 3/3 | ||
| Delta | 59% | 55% | 99% | 55% | |||
| 18 | What releases have been deployed in our error tracking tool? List some recent releases and any notable details about them. Complex Single source | MCP | 73.9s | 12 | 200,065 | $0.27 | 3/3 |
| Coral | 58s | 16 | 132,590 | $0.19 | 3/3 | ||
| Delta | -22% | 33% | -34% | -31% | |||
| 19 | What teams exist in our error tracking tool? List them all with their names. Simple Single source | MCP | 20.2s | 7 | 88,178 | $0.13 | 2/2 |
| Coral | 16.4s | 6 | 66,140 | $0.08 | 2/2 | ||
| Delta | -19% | -14% | -25% | -39% | |||
| 20 | Are any teams using sprint cycles in our project management tool? If so, which teams and what are the cycle dates? Simple Single source | MCP | 23.8s | 14 | 72,534 | $0.11 | 3/3 |
| Coral | 21s | 9 | 89,022 | $0.11 | 3/3 | ||
| Delta | -12% | -36% | 23% | -2% | |||
| 21 | Which teams have the most projects in our project management tool? Rank the teams by number of projects. Complex Single source | MCP | 41.2s | 25 | 186,147 | $0.56 | 3/3 |
| Coral | 22.4s | 10 | 91,097 | $0.12 | 3/3 | ||
| Delta | -46% | -60% | -51% | -79% | |||
| 22 | How many users are in our project management tool? List them all with their email addresses. Simple Single source | MCP | 17.1s | 4 | 55,055 | $0.10 | 3/3 |
| Coral | 18.1s | 4 | 69,605 | $0.11 | 3/3 | ||
| Delta | 6% | 0% | 26% | 11% | |||
| 23 | Who has created the most dashboards in our monitoring tool? And are those people still active on the team? Complex Single source | MCP | 97.3s | 44 | 164,292 | $0.82 | 0/3 |
| Coral | 26.6s | 8 | 119,321 | $0.13 | 3/3 | ||
| Delta | -73% | -82% | -27% | -84% | |||
| 24 | Are there any Datadog user accounts with incomplete profile information, like a missing display name? I want to clean up our user directory. Complex Single source | MCP | 166.4s | 56 | 545,394 | $0.86 | 0/1 |
| Coral | 18.2s | 7 | 73,894 | $0.09 | 1/1 | ||
| Delta | -89% | -88% | -86% | -89% | |||
| 25 | I noticed some of our Sentry member profiles might not have proper display names set up. Can you check who has a real name configured versus just showing their email address? Complex Single source | MCP | 118.8s | 40 | 431,404 | $0.64 | 1/2 |
| Coral | 24.4s | 6 | 83,866 | $0.11 | 2/2 | ||
| Delta | -79% | -85% | -81% | -82% | |||
| 26 | What roles do our team members have in Sentry? I want to understand the breakdown of owners, managers, and regular members in our error tracking organization. Complex Single source | MCP | 78.2s | 30 | 309,826 | $0.46 | 2/3 |
| Coral | 24.7s | 6 | 89,120 | $0.11 | 3/3 | ||
| Delta | -68% | -80% | -71% | -75% | |||
| 27 | What teams do we have in our error tracking tool and how big is each one? Complex Single source | MCP | 86.9s | 38 | 751,399 | $0.47 | 2/2 |
| Coral | 21.2s | 7 | 78,921 | $0.11 | 2/2 | ||
| Delta | -76% | -82% | -89% | -76% | |||
| 28 | Do we have any duplicate dashboard names in our monitoring tool? If so, who created them? Complex Single source | MCP | 178.8s | 69 | 191,513 | $0.50 | 1/3 |
| Coral | 29s | 7 | 116,975 | $0.13 | 3/3 | ||
| Delta | -84% | -90% | -39% | -74% | |||
| 29 | Give me an overview of our infrastructure. How many hosts do we have running and what region are they in? Simple Single source | MCP | 38.1s | 7 | 89,795 | $0.12 | 1/2 |
| Coral | 41.9s | 10 | 111,061 | $0.13 | 2/2 | ||
| Delta | 10% | 43% | 24% | 11% | |||
| 30 | Which of our monitors are related to Kubernetes? List them. Simple Single source | MCP | 26.7s | 7 | 74,031 | $0.13 | 3/3 |
| Coral | 27.4s | 8 | 69,533 | $0.10 | 3/3 | ||
| Delta | 3% | 14% | -6% | -20% | |||
| 31 | What label groups do we use to categorize issues in our project management tool? What are the group names? Complex Single source | MCP | 134s | 36 | 949,867 | $0.63 | 3/3 |
| Coral | 27.1s | 6 | 98,375 | $0.13 | 3/3 | ||
| Delta | -80% | -83% | -90% | -80% | |||
| 32 | What projects do we have in our error tracking tool? What platforms are they on? Complex Single source | MCP | 92.1s | 30 | 233,091 | $0.43 | 2/3 |
| Coral | 24.9s | 8 | 83,438 | $0.09 | 3/3 | ||
| Delta | -73% | -73% | -64% | -79% | |||
| 33 | What is the current status of our monitoring alerts? How many are in OK vs alerting states? Do we have any projects or issues in our project management tool related to improving our monitoring or observability setup? Simple Multi source | MCP | 46s | 12 | 156,649 | $0.21 | 3/3 |
| Coral | 56.6s | 16 | 122,775 | $0.19 | 3/3 | ||
| Delta | 23% | 33% | -22% | -10% | |||
| 34 | Which projects have been completed in our project management tool that relate to infrastructure or deployment? Do any of them correspond to things we can see in our monitoring tool, like dashboards or hosts? Simple Multi source | MCP | 59.1s | 9 | 91,379 | $0.24 | 3/3 |
| Coral | 52.2s | 15 | 120,569 | $0.21 | 3/3 | ||
| Delta | -12% | 67% | 32% | -9% | |||
| 35 | Do our project management and error tracking tools have the same team structure? Which teams exist in both, and which are only in one? Simple Multi source | MCP | 23.3s | 9 | 92,062 | $0.11 | 3/3 |
| Coral | 19.7s | 6 | 67,726 | $0.11 | 3/3 | ||
| Delta | -16% | -33% | -26% | 4% | |||
| 36 | Do we have dashboards in our monitoring tool that relate to the applications we track in our error tracking tool? For example, is there a dashboard for our frontend app or titaness? What about Kubernetes infrastructure dashboards? Simple Multi source | MCP | 30.2s | 9 | 97,725 | $0.13 | 3/3 |
| Coral | 30.8s | 11 | 92,102 | $0.14 | 3/3 | ||
| Delta | 2% | 22% | -6% | 5% | |||
| 37 | Which teams in our error tracking tool have the most members, and do those same teams also exist in our project management tool? Which error tracking teams are unique to that tool? Complex Multi source | MCP | 82.3s | 29 | 254,080 | $0.31 | 3/3 |
| Coral | 32.1s | 14 | 108,141 | $0.13 | 3/3 | ||
| Delta | -61% | -52% | -57% | -57% | |||
| 38 | Who were the first people to set up accounts across our tools? I want to know who our original tool administrators or founders were. Complex Multi source | MCP | 94s | 31 | 284,310 | $0.40 | 3/3 |
| Coral | 45.3s | 18 | 153,391 | $0.27 | 3/3 | ||
| Delta | -52% | -42% | -46% | -31% | |||
| 39 | Which of our infrastructure hosts are dedicated to specific applications? Look at our monitoring tool’s host list. Do any of those application names also appear as projects in our error tracking tool, and if so, what platform are they built on? Simple Multi source | MCP | 76.7s | 16 | 172,096 | $0.29 | 3/3 |
| Coral | 32.7s | 9 | 69,836 | $0.11 | 3/3 | ||
| Delta | -57% | -44% | -59% | -60% | |||
| 40 | How do our incident records in our monitoring tool compare to the volume of issues in our error tracking tool? Are we recording incidents proportionally to the errors we’re seeing? Simple Multi source | MCP | 37.2s | 10 | 78,974 | $0.13 | 2/2 |
| Coral | 46s | 14 | 114,949 | $0.16 | 2/2 | ||
| Delta | 24% | 40% | 46% | 20% | |||
| 41 | What strategic initiatives are we tracking in our project management tool? Do any of them relate to dashboards we have in our monitoring tool? For example, is there alignment between our initiatives and what we’re monitoring? Simple Multi source | MCP | 44.4s | 6 | 63,416 | $0.17 | 3/3 |
| Coral | 67.1s | 13 | 192,464 | $0.27 | 3/3 | ||
| Delta | 51% | 117% | 203% | 57% | |||
| 42 | How do we categorize issues across our tools? What labels does our project management tool use for issues, and what severity levels appear in our error tracking tool’s issues? Complex Multi source | MCP | 66.8s | 22 | 204,014 | $0.30 | 3/3 |
| Coral | 54.1s | 15 | 115,685 | $0.16 | 3/3 | ||
| Delta | -19% | -32% | -43% | -47% | |||
| 43 | Both our project management tool and error tracking tool track “issues”. How do the issue counts compare between the two, and how do the issues differ in nature? What do issues represent in each tool? Complex Multi source | MCP | 167.9s | 46 | 908,287 | $0.81 | 3/3 |
| Coral | 53.8s | 15 | 114,316 | $0.15 | 3/3 | ||
| Delta | -68% | -67% | -87% | -81% | |||
| 44 | We use Kubernetes for our infrastructure. What Kubernetes-related monitors do we have in our monitoring tool, and do we have any projects or issues in our project management tool related to Kubernetes deployment or infrastructure? Simple Multi source | MCP | 34.6s | 8 | 61,420 | $0.14 | 2/2 |
| Coral | 60.6s | 11 | 106,711 | $0.21 | 2/2 | ||
| Delta | 75% | 38% | 74% | 48% | |||
| 45 | Do our project management tool and error tracking tool track the same projects? Which project names appear in both, and which are only in one tool? Complex Multi source | MCP | 61.7s | 8 | 125,228 | $0.35 | 2/3 |
| Coral | 39.5s | 13 | 130,735 | $0.16 | 3/3 | ||
| Delta | -36% | 62% | 4% | -55% | |||
| 46 | What types of monitors do we have in our monitoring tool? Break them down by type (e.g. query alert, log alert). Then check our error tracking tool — what platforms are our tracked projects built on? Is our monitoring covering the right technology? Complex Multi source | MCP | 89.8s | 19 | 167,818 | $0.32 | 2/2 |
| Coral | 36.8s | 12 | 110,386 | $0.14 | 2/2 | ||
| Delta | -59% | -37% | -34% | -56% | |||
| 47 | Give me a census of our operational tooling. For each tool we use (project management, monitoring, and error tracking), tell me the key entity counts: teams, projects, issues, monitors, dashboards, hosts — whatever is relevant for that tool. Complex Multi source | MCP | 99s | 35 | 172,925 | $0.63 | 2/3 |
| Coral | 49.8s | 16 | 87,683 | $0.13 | 3/3 | ||
| Delta | -50% | -54% | -49% | -79% | |||
| 48 | How does the breadth of project tracking in our project management tool compare to our error tracking tool? How many projects does each tool have, and what does this tell us about our coverage? Complex Multi source | MCP | 75.4s | 14 | 306,004 | $0.75 | 3/3 |
| Coral | 42.4s | 10 | 111,578 | $0.20 | 3/3 | ||
| Delta | -44% | -29% | -64% | -73% | |||
| 49 | How many teams do we have in our project management tool versus our error tracking tool? Which teams exist only in one tool and not the other? Simple Multi source | MCP | 22.8s | 9 | 92,088 | $0.11 | 3/3 |
| Coral | 17.7s | 6 | 67,332 | $0.08 | 3/3 | ||
| Delta | -22% | -33% | -27% | -22% | |||
| 50 | In our error tracking tool, which teams are the largest and smallest by member count? Do those same teams exist in our project management tool, and if so, how many projects do they own? Complex Multi source | MCP | 172.7s | 53 | 459,067 | $0.75 | 2/3 |
| Coral | 28.4s | 12 | 92,406 | $0.12 | 3/3 | ||
| Delta | -84% | -77% | -80% | -83% | |||
| 51 | What technology stack does our application use? Check what platforms our error tracking projects are built on and what services appear in our monitoring tool’s service catalog. Do the service names match project names? Complex Multi source | MCP | 96.8s | 24 | 211,153 | $0.35 | 3/3 |
| Coral | 140.7s | 43 | 711,979 | $0.61 | 3/3 | ||
| Delta | 45% | 79% | 237% | 76% | |||
| 52 | We deploy an application called “titaness”. What monitoring and error tracking do we have set up for it? Is there a dedicated monitor and what platform does the error tracking project use? Simple Multi source | MCP | 31.4s | 14 | 119,556 | $0.14 | 2/2 |
| Coral | 23s | 10 | 86,370 | $0.11 | 2/2 | ||
| Delta | -27% | -29% | -28% | -24% | |||
| 53 | Our main application is called “titaness”. What services related to titaness show up in our monitoring tool’s service catalog, and what are the top errors it’s generating in our error tracking tool? Simple Multi source | MCP | 51.9s | 13 | 156,588 | $0.18 | 3/3 |
| Coral | 102.1s | 32 | 311,577 | $0.33 | 3/3 | ||
| Delta | 97% | 146% | 99% | 79% | |||
| 54 | How many active team members do we have set up in each of our tools? I want a quick headcount across Datadog, Sentry, and Linear to see if the numbers are consistent. Complex Multi source | MCP | 145.8s | 46 | 218,411 | $0.93 | 1/3 |
| Coral | 55.2s | 10 | 105,850 | $0.19 | 3/3 | ||
| Delta | -62% | -78% | -52% | -79% | |||
| 55 | Are there any team members whose names are spelled or displayed differently across our tools? I want to clean up any inconsistencies in our user directory. Complex Multi source | MCP | 83.1s | 31 | 328,055 | $0.42 | 1/2 |
| Coral | 45s | 12 | 113,692 | $0.16 | 2/2 | ||
| Delta | -46% | -61% | -65% | -61% | |||
| 56 | Do we have any service accounts or bot accounts in our monitoring tool (Datadog)? I want to audit non-human accounts that might have API access. Complex Multi source | MCP | 153.8s | 44 | 382,568 | $1.19 | 0/2 |
| Coral | 33.1s | 6 | 90,789 | $0.12 | 1/2 | ||
| Delta | -78% | -86% | -76% | -90% | |||
| 57 | Do we have any duplicate or shared-email accounts in our monitoring tool? I want to make sure we don’t have redundant accounts that could cause confusion or security issues. Complex Multi source | MCP | 122.9s | 32 | 684,883 | $0.72 | 0/2 |
| Coral | 26.9s | 7 | 103,350 | $0.12 | 1/2 | ||
| Delta | -78% | -78% | -85% | -83% | |||
| 58 | Are there any team members who have been deactivated or are missing from some of our tools but not others? I want to make sure everyone’s access is consistent across platforms. Complex Multi source | MCP | 112.6s | 42 | 335,140 | $0.58 | 1/2 |
| Coral | 52.2s | 10 | 79,708 | $0.17 | 2/2 | ||
| Delta | -54% | -76% | -76% | -71% | |||
| 59 | I need to review our org permissions. Who has elevated access like owner, manager, or admin roles across our tools? Give me a single consolidated list with their roles per tool. Complex Multi source | MCP | 175.3s | 42 | 516,256 | $0.82 | 1/4 |
| Coral | 48.8s | 16 | 130,645 | $0.16 | 4/4 | ||
| Delta | -61% | -62% | -75% | -81% | |||
| 60 | Are there any inconsistencies in who has admin or elevated permissions across our tools? For example, someone who’s an admin in one tool but just a regular member in another. Complex Multi source | MCP | 112s | 41 | 555,916 | $0.60 | 2/2 |
| Coral | 42s | 13 | 112,681 | $0.16 | 2/2 | ||
| Delta | -62% | -68% | -80% | -73% | |||
| 61 | Who are the admins in our project management tool, and are any of them also on a team in our error tracking tool? Complex Multi source | MCP | 141.1s | 43 | 46,047 | $0.48 | 0/3 |
| Coral | 81.3s | 19 | 237,472 | $0.25 | 3/3 | ||
| Delta | -42% | -56% | 416% | -48% | |||
| 62 | Who are the former team members whose accounts have been disabled in Datadog? I want to review our alumni list for compliance records. Complex Multi source | MCP | 99.5s | 37 | 800,622 | $0.51 | 3/3 |
| Coral | 25.3s | 6 | 84,164 | $0.11 | 3/3 | ||
| Delta | -75% | -84% | -89% | -78% | |||
| 63 | We run our infrastructure on Kubernetes in AWS. How many hosts do we have, and does our error tracking tool have a project that matches our main application? What platform is it on? Simple Multi source | MCP | 51.2s | 14 | 169,581 | $0.20 | 2/3 |
| Coral | 30s | 6 | 89,934 | $0.11 | 3/3 | ||
| Delta | -41% | -57% | -47% | -45% | |||
| 64 | How much are we using each of our tools? Give me a quick comparison of the scale — how many issues, projects, dashboards, monitors, etc. across our project management, error tracking, and monitoring tools. Complex Multi source | MCP | 142.8s | 42 | 254,920 | $1.10 | 1/3 |
| Coral | 53.6s | 26 | 70,307 | $0.17 | 3/3 | ||
| Delta | -62% | -38% | -72% | -84% | |||
| 65 | I’m building a new feature on titaness that will process incoming data and write results back. Before I start, what should I be aware of to make sure it fits with the rest of our system? Any active work, known issues, or patterns I should follow? Complex Multi source | MCP | 207s | 55 | 558,570 | $0.80 | 4/4 |
| Coral | 258s | 63 | 634,497 | $0.77 | 3/4 | ||
| Delta | 25% | 15% | 14% | -3% | |||
| 66 | I’m seeing CI failures on my branch of the coral repo. Can you check what CI workflows are configured and whether there are other branches with failing CI? Also, has anyone been discussing build issues in our engineering channel recently? Complex Multi source | MCP | 166s | 65 | 1,013,912 | $0.76 | 0/4 |
| Coral | 240s | 30 | 467,969 | $0.60 | 3/4 | ||
| Delta | 45% | -54% | -54% | -21% | |||
| 67 | We’re about to cut a new release of coral. What’s the latest release version, and are there any open errors or active incidents that should block the release? Complex Multi source | MCP | 164s | 59 | 2,680,766 | $0.84 | 4/4 |
| Coral | 175s | 37 | 737,776 | $0.68 | 4/4 | ||
| Delta | 7% | -37% | -72% | -18% | |||
| 68 | I need to make changes to the coral repo. Who are the most active contributors, and is there a team that formally owns this area? I want to know who to ask for code review. Complex Multi source | MCP | 61s | 23 | 273,197 | $0.34 | 1/4 |
| Coral | 193s | 41 | 905,857 | $0.91 | 3/4 | ||
| Delta | 216% | 78% | 232% | 171% | |||
| 69 | I need to do a quarterly access review. Who has admin or elevated permissions across our tools? Make sure to cover our project tracker, error tracker, and chat workspace. Complex Multi source | MCP | 109s | 43 | 213,159 | $0.44 | 1/4 |
| Coral | 42s | 18 | 141,157 | $0.21 | 3/4 | ||
| Delta | -61% | -58% | -34% | -53% | |||
| 70 | I want to see who’s been most active on our core repos recently. Who are the top contributors to the coral repo, and what projects are they assigned to? Complex Single source | MCP | 66s | 18 | 361,364 | $0.51 | 4/4 |
| Coral | 126s | 24 | 514,279 | $0.59 | 4/4 | ||
| Delta | 91% | 33% | 42% | 17% | |||
| 71 | What strategic initiatives are we tracking, and for each one, can you check if there’s related activity elsewhere in our stack? For example, does the BYOC initiative have a corresponding errors project? Does the Data Ingestion initiative have related dashboards? Complex Multi source | MCP | 104s | 20 | 183,485 | $0.43 | 4/4 |
| Coral | 153s | 36 | 445,860 | $0.62 | 4/4 | ||
| Delta | 47% | 80% | 143% | 42% | |||
| 72 | How’s our software quality looking? I want a cross-tool view: how many unresolved errors do we have per project, what percentage of our monitors are in a healthy state, and how many open bugs are being tracked? Simple Multi source | MCP | 58s | 10 | 103,824 | $0.26 | 3/4 |
| Coral | 100s | 17 | 189,514 | $0.22 | 4/4 | ||
| Delta | 72% | 70% | 83% | -16% | |||
| 73 | I need to understand our team structure across tools. What teams do we have in our different systems, which team names are shared between them, and are there teams that only exist in one place? Simple Multi source | MCP | 31s | 11 | 61,875 | $0.12 | 4/4 |
| Coral | 56s | 24 | 277,255 | $0.34 | 4/4 | ||
| Delta | 81% | 118% | 348% | 193% | |||
| 74 | I just joined and I’m picking up my first feature. It’s about bringing more telemetry data into our backend storage. Before I start, what related work is already in flight, what dashboards cover this area, and what could break that I should watch out for? Complex Multi source | MCP | 250s | 61 | 5,459,803 | $0.86 | 3/4 |
| Coral | 198s | 78 | 1,325,682 | $0.55 | 3/4 | ||
| Delta | -21% | 28% | -76% | -36% | |||
| 75 | I want to know who’s on the team. Can you pull together a people directory from our tools? Who’s in our chat workspace, how are people organized into teams, and who has accounts across our engineering systems? Complex Multi source | MCP | 61s | 33 | 158,286 | $0.30 | 3/4 |
| Coral | 115s | 37 | 403,018 | $0.56 | 4/4 | ||
| Delta | 89% | 12% | 155% | 87% | |||
| 76 | What are the most important things the engineering team is working on right now? What active projects and initiatives do we have, and do any of them map to recent code activity? Which repos are getting the most PRs? Complex Multi source | MCP | 95s | 25 | 313,727 | $0.66 | 3/4 |
| Coral | 798s | 46 | 510,274 | $1.16 | 4/4 | ||
| Delta | 740% | 84% | 63% | 77% | |||
| 77 | I just joined the team. Can you help me understand what services we run? Look at what hosts we’ve deployed, what projects are being tracked for errors, and what’s in our source repos. I want to build a mental model of our architecture. Simple Multi source | MCP | 57s | 12 | 119,704 | $0.23 | 4/5 |
| Coral | 89s | 22 | 267,460 | $0.41 | 5/5 | ||
| Delta | 56% | 83% | 123% | 77% | |||
| 78 | What are the most common things that go wrong in our systems? Look at our top errors, what we monitor and alert on, and any issues labeled as bugs. I want to understand our reliability weak spots. Complex Multi source | MCP | 240s | 61 | 2,498,271 | $0.97 | 4/4 |
| Coral | 239s | 86 | 980,200 | $0.50 | 4/4 | ||
| Delta | 0% | 41% | -61% | -48% | |||
| 79 | An alert just fired. Can you show me all monitors that are currently in an alert or warning state, and check if there are corresponding error spikes in our apps? I also want to know if anything was posted to our #alerts channel about this. Simple Multi source | MCP | 60s | 17 | 173,371 | $0.24 | 3/4 |
| Coral | 107s | 38 | 544,972 | $0.50 | 4/4 | ||
| Delta | 78% | 124% | 214% | 111% | |||
| 80 | Give me a health check of our infrastructure. How many hosts do we have and are they all up? What resource monitors do we have (memory, CPU, disk, pod health)? And are there any infrastructure-related issues being tracked? Complex Multi source | MCP | 70s | 12 | 108,692 | $0.24 | 4/4 |
| Coral | 94s | 27 | 364,710 | $0.38 | 4/4 | ||
| Delta | 34% | 125% | 236% | 59% | |||
| 81 | I’m handing off on-call. Can you give me a status summary? What monitors are configured and what are their current states? Are there any active incidents? What’s the current error landscape — which project has the most unresolved issues? Simple Multi source | MCP | 39s | 11 | 68,726 | $0.16 | 4/4 |
| Coral | 57s | 17 | 163,175 | $0.25 | 4/4 | ||
| Delta | 46% | 55% | 137% | 60% | |||
| 82 | I need to write a postmortem covering our ClickHouse reliability issues. Can you help me gather the facts? Find relevant ClickHouse errors (resolved or active), any dashboards covering ClickHouse, and check if there are tickets tracking fixes. Simple Multi source | MCP | 44s | 11 | 74,281 | $0.18 | 4/4 |
| Coral | 80s | 21 | 233,080 | $0.40 | 4/4 | ||
| Delta | 82% | 91% | 214% | 116% |