Claims-to-Windows Identity Translation Solutions and "Considerations" when using AD Application Proxy
Problem Statement:
At one of my consulting engagement this year my team were unable to communicate from a claims aware azure web application via client browser to an on-premise, windows authenticated SOAP endpoint. To overcome the identity endpoint mismatch, Claims and Windows, we were using the Azure Application Proxy to perform the identity translation. Our problems primarily occur in communicating between the browser and the Azure Application Proxy. We found multiple potential solutions to solve this problem, but each one has a fatal flaw. We do not need to solve for every solution, rather we only need 1 to work.
Solution 1:
Hide an iframe in the page that authenticates to the proxy by hitting a proxy endpoint and performing the redirect dance. Because the user must first log-in to the application, the iframe can reuse these credentials.
Process Flow Description:
- Iframe makes a request to the proxy endpoint (without authentication)
- Proxy returns a 302 redirect
- Iframe is redirected to AAD login page. Login cookies are submitted to AAD login because application requires authentication.
- Login successful returning token
- Sends token to proxy
- Proxy returns cookie that is valid for the proxy.
- Any future calls to the proxy can use the proxy cookie and make successful calls.
This solution works for the majority of cases except…
Fatal flaw: During step 3, if the user has multiple logins to Azure AD the user can not automatically be logged in because AAD returns an HTML to the hidden iframe asking which to use for login.
Potential fixes:
- Enable home realm discovery (Domain_Hint) for the Application Proxy
- When enabling domain hints, step 2 will return an updated redirect URL to include an extra parameter, ‘&domain_hint=fmi.com’. With this extra information in Step 3, the AAD login page can automatically determine which user to login as. Now the iframe can successfully login and the requests going forward will succeed.
- Blocker: this feature is not available yet for App Proxy.
- Use a Smart Link
- A smart link tells AAD login similar information as the domain hint. I’ve tried using the below smart link, but it does not automatically detect the user domain to use. e.g. https://myapps.microsoft.com/fmi.com/signin/Home%20Proxy%20Dev/000-000-000-000
- Blocker: The Smart link still requires me to choose which user to select.
Solution 2:
Use ADAL.js to retrieve a bearer token for authentication to the Application Proxy endpoint.
- ADAL.js calls AcquireToken to requesting a bearer token for the Application Proxy Endpoint.
- AAD returns an authentication token.
- We make JavaScript calls adding the header “authentication: bearer [token]” so we are properly authenticated to the endpoint.
This solution works for Internet Explorer but in any other browser it fails
Fatal Flaw: When making requests in step 3 with the authentication header, the browser sends a CORs preflight request. The proxy is not handling the OPTIONS request properly and is returning a 302.
Potential Fixes:
- Enable CORs on the Application Proxy so that Preflight requests are handled gracefully.
- Blocker: this feature is not available yet for App Proxy.
Summary:
We communicated these shortcomings of AAD Application Proxy to Microsoft and hope they would prioritize this feature in next release. Hope you would be able to customize your design keeping the above solutions and it's shortcomings in mind.
App Service Plan – Outbound Network Connection Limit
Few days back I ran into a problem where our production azure web apps were throwing below error:
[SocketException (0x271d): An attempt was made to access a socket in a way forbidden by its access permissions x.x.x.x:80] System.Net.Sockets.Socket.DoConnect(EndPoint endPointSnapshot, SocketAddress socketAddress) +208 System.Net.ServicePoint.ConnectSocketInternal(Boolean connectFailure, Socket s4, Socket s6, Socket& socket, IPAddress& address, ConnectSocketState state, IAsyncResult asyncResult, Exception& exception) +464
We opened a case with Microsoft and upon investigation they told us that your App Service Plan (running on Standard S1 2 Instances) are hitting the outbound connection limit. What? How the heck we know that? As of when i am writing, below were the connection limits given my MS.
App Service Plan | Connection Limit |
Free F1 | 250 |
Shared D1 | 250 |
Basic B1 1 Instance | 1920 |
Basic B2 1 Instance | 3968 |
Basic B3 1 Instance | 8064 |
Standard S1 1 Instance | 1920 |
Standard S1 2 Instances | 1920 per instance |
Standard S2 1 Instance | 3968 |
Standard S3 1 Instance | 8064 |
Premium P1 1 Instance (Preview) | 1920 |
On further request, MS gave us a table of apps under the app service place and their open socket connection count. It clearly indicates that Web App 1 worker process is not reusing the connection pool and creating new connections hitting the overall limit of the app service plan.
WebApp Name | Process Name | Open Socket Count |
App2 | <WebJob>.WebJob.exe | 4 |
App2 | <WebJob>.WebJob.exe | 4 |
App2 | w3wp.exe | 2 |
App1 | <WebJob>.WebJob.exe | 8 |
App1 | <WebJob>.WebJob.exe | 4 |
App1 | <WebJob>.WebJob.exe | 6 |
App1 | w3wp.exe | 2 |
App1 | w3wp.exe | 1870 |
App1 | <WebJob>.WebJob.exe | 6 |
App1 | w3wp.exe | 2 |
App1 | <WebJob>.WebJob.exe | 6 |
App3 | w3wp.exe | 4 |
App3 | w3wp.exe | 2 |
Total | 1920 |
With the above data from MS at least you would be able to know where the problem lies and can review the app again.
For your web apps, you can at least review the code (ensuring it doesn't happen to your azure web apps) where you are handing the connection with external entities. Some of the common external dependencies in modern cloud world are:
- SQL - https://azure.microsoft.com/en-us/documentation/articles/sql-database-develop-dotnet-simple/
- Redis - https://azure.microsoft.com/en-us/documentation/articles/cache-dotnet-how-to-use-azure-redis-cache/
- Service Bus - https://azure.microsoft.com/en-us/documentation/articles/service-bus-performance-improvements/
Thanks to the blog http://www.freekpaans.nl/2015/08/starving-outgoing-connections-on-windows-azure-web-sites/ which explains about the same problem.
However the fact is with no monitoring tool available which monitors the open socket count, you will never be able to know the number of open socket connections for your app service plan unless requested from Microsoft.
Troubleshooting automatic restart of Azure Web Jobs
Lately I was working on production issue where the web jobs hosted on azure were automatically restarting by itself or moving in stopped state. There was no signs of user manually restarting it (you can watch those via Activity Logs on your website). So the question is why was it happening? The answer lies in the web job logs. In order to understand it better, I have laid out mostly all reasons of automatic restart with sample logs (you can get the log from either Kudu web job dashboard or directly from storage account).
Reason 1: Due to website shutdown/restart
[10/24/2016 06:53:36 > b3e7a2: SYS INFO] WebJob is stopping due to website shutting down [10/24/2016 06:53:36 > b3e7a2: SYS INFO] Status changed to Stopping [10/24/2016 06:53:39 > b3e7a2: INFO] Job host stopped [10/24/2016 06:53:41 > b3e7a2: ERR ] Thread was being aborted. [10/24/2016 06:53:42 > b3e7a2: SYS INFO] WebJob process was aborted [10/24/2016 06:53:42 > b3e7a2: SYS INFO] Status changed to Stopped [10/24/2016 06:59:17 > 521cd3: SYS INFO] Status changed to Starting [10/24/2016 06:59:20 > 521cd3: SYS INFO] Run script '<YourWebJobName>.WebJob.exe' with script host - 'WindowsScriptHost' [10/24/2016 06:59:20 > 521cd3: SYS INFO] Status changed to Running
Reason 2: Due to changes in the azure web job directory files or file content (D:\home\site\wwwroot\app_data\jobs\continuous\<WebJobName>)
[10/29/2016 00:00:43 > 521cd3: SYS INFO] Detected WebJob file/s were updated, refreshing WebJob [10/29/2016 00:00:43 > 521cd3: SYS INFO] Status changed to Stopping [10/29/2016 00:00:47 > 8d7eea: SYS INFO] Detected WebJob file/s were updated, refreshing WebJob [10/29/2016 00:00:47 > 8d7eea: SYS INFO] Status changed to Stopping [10/29/2016 00:00:48 > 521cd3: INFO] Job host stopped [10/29/2016 00:00:49 > 521cd3: ERR ] Thread was being aborted. [10/29/2016 00:00:51 > 521cd3: SYS INFO] Status changed to Stopped [10/29/2016 00:00:51 > 521cd3: SYS INFO] Status changed to Starting [10/29/2016 00:00:52 > 521cd3: SYS INFO] WebJob process was aborted [10/29/2016 00:00:52 > 521cd3: SYS INFO] Status changed to Stopped [10/29/2016 00:00:52 > 521cd3: SYS INFO] Job directory change detected: Job file 'ApplicationInsights.config' timestamp differs between source and working directories. [10/29/2016 00:00:51 > 8d7eea: INFO] Job host stopped [10/29/2016 00:00:52 > 8d7eea: ERR ] Thread was being aborted. [10/29/2016 00:00:52 > 8d7eea: SYS INFO] WebJob process was aborted [10/29/2016 00:00:52 > 8d7eea: SYS INFO] Status changed to Stopped [10/29/2016 00:00:53 > 8d7eea: SYS INFO] Status changed to Starting [10/29/2016 00:00:54 > 8d7eea: SYS INFO] Job directory change detected: Job file 'ApplicationInsights.config' timestamp differs between source and working directories.[10/29/2016 00:01:01 > 521cd3: SYS INFO] Run script '<YourWebJobName>.WebJob.exe' with script host - 'WindowsScriptHost' [10/29/2016 00:01:01 > 521cd3: SYS INFO] Status changed to Running
Reason 3: Due to a web app and/or web job deployment
Easy to understand since it will trigger Scenario 2
Reason 4: Due to an azure outage or maintenance
Assuming you can rule out reason 1, 3 and 4 (straightforward too) by using standard azure web app monitoring tools (like Failure History and others), 2 would still require further analysis.
Basically Reason 2 indicates that if there is a change in the web job directory i.e. a new file/folder is added or removed automatically or manually, web jobs will restart. Or if the content of the file within the directory is changes, that would also initiate trigger in web job restart.
In that case need to dig further what triggered the directory and/or file content changes within web job directory. You need to ask below question:
- Was there any changes to web app settings via azure portal?
- Was there any runtime SDK upgrades? Like upgrading App Insights or installing an extension
- Are you creating any temporary files at runtime with the web job directory?
You might want to review the application design if 3 is correct.
This helped me fixing my production issue. Hope same for you.
Automating OMS Searches, Schedules, and Alerts in Azure Resource Manager Templates
Recently, I had an opportunity to produce a proof of concept around leveraging Operations Management Suite (OMS) as a centralized point of monitoring, analytics, and alerting. As in any good Azure project, codifying your Azure resources in an Azure Resource Manager template should be table stakes. While the official ARM schema doesn't yet have all the children for the Microsoft.OperationalInsights/workspaces type, the Log Analytics documentation does document samples for interesting use cases that demonstrate data sources and saved searches. Notably absent from this list, however, is anything about the /schedules and /action types. Now, a clever Azure Developer will realize that an ARM template is basically an orchestration and expression wrapper around the REST APIs. In fact, I've yet to see a case where an ARM template supports a type that's not provided by those APIs. With that knowledge and the alert API samples, we can adapt the armclient put <url> examples to an ARM template.
The Resources
What you see as an alert in the OMS portal is actually 2 to 3 distinct resources in the ARM model:
A mandatory schedules type, which defines the period and time window for the alert
{ "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'), '/', variables('Schedule-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches/schedules", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [ "[variables('Search-Id')]" ], "properties": { "Interval": 5, "QueryTimeSpan": 5, "Active": "true" } }
A mandatory actions type, defining the trigger and email notification settings
{ "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'), '/', variables('Schedule-Name'), '/', variables('Alert-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches/schedules/actions", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [ "[variables('Schedule-Id')]" ], "properties": { "Type": "Alert", "Name": "[parameters('Search-DisplayName')]", "Threshold": { "Operator": "gt", "Value": 0 }, "Version": 1 } }
An optional, secondary, actions type, defining the alert's webhook settings
{ "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'), '/', variables('Schedule-Name'), '/', variables('Action-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches/schedules/actions", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [ "[variables('Schedule-Id')]" ], "properties": { "Type": "Webhook", "Name": "[variables('Action-Name')]", "WebhookUri": "[parameters('Action-Uri')]", "CustomPayload": "[parameters('Action-Payload')]", "Version": 1 } }
The Caveats
The full template adds a few eccentricities that are necessary for fully functioning resources.
First, let's talk about the search. At the OMS interface, when you create a search, it is named <Category>|<Name> behind the scenes. It is also lower-cased. Now, the APIs will not downcase any name you send in, but if you use the interface to update that API-deployed search, you will duplicate it rather than being prompted to overwrite. This tells us that for whatever reason, OMS' identifiers are case-sensitive. This is why the template accepts a search category and name in parameters, then use variables to lower them and match the interface's naming convention.
Second, a schedule's name must be globally unique to your workspace, though the actions need not be. Again, the OMS UI will take over here and generate GUIDs for the resources behind the scenes. Unfortunately for us, ARM Templates don't support random number or GUID generation - a necessary evil of being deterministic. While the template could accept a GUID by parameter, I find that asking my users for GUIDs is impolite - a uniquestring() function on the already-unique search name should suffice for most use cases. Since the actions don't require uniqueness, static names will do.
Lastly, this template is not idempotent. Idempotency is a characteristic of a desired state technology that focuses on ensuring the state (in our case, the template), rather than prescribing process (create this, update that, delete the other thing). Each of Azure's types is responsible for ensuring that they implement idempotency as much as possible. For some resources like the database extensions type, which imports .bacpac files, the entire type opts out of idempotency (once data is in the database, no more importing). For others (such as Azure VMs), only changes in certain properties like administrator name/password break idempotency. In the OMS search and schedule's case, if either one exists when you deploy the template, the deployment will fail with a code of BadRequest. To add insult to injury, deleting the saved search from the interface does not delete the schedule. Neither does deleting the alert from the UI - it simply flips the scheduled to "Enabled": false. There is, in fact, no way to delete the schedule via the UI - you must do so via the REST API, similar to the following:
armclient delete /{Schedule ResourceID}?api-version=2015-03-20
This means that, for now, any versioned solution which desires to deploy ARM template updates to its searches and alerts will need to either:
- Delete the existing search/schedule, then redeploy OR
- Deploy the updated version side-by-side (named differently), then clean up the prior version's items.
Luckily ours was a POC, so deleting and re-creating was very much an option. I look forward to days when search and alerting types become first-class, idempotent citizens. Until then, I hope this approach proves useful!
The Template
The template below deploys a search, a schedule, a threshold action (the alert), and a webhook action (the action) that includes a custom JSON payload (remember to escape your JSON!). With a little work, this template could be reworked to segregate the alert-specific components from the search so that multiple alerts could be deployed to the same search or adapted for use with Azure Automation Runbooks. Enjoy!
{ "$schema": "https://schema.management.azure.com/schemas/2015-01-01/deploymentTemplate.json#", "contentVersion": "1.0.0.0", "parameters": { "Action-Payload": { "type": "string" }, "Action-Uri": { "type": "string" }, "Alert-DisplayName": { "type": "string" }, "Search-Category": { "type": "string" }, "Search-DisplayName": { "type": "string" }, "Search-Query": { "type": "string" }, "Workspace-Name": { "type": "string" }, "Workspace-Location": { "type": "string" } }, "variables": { "Action-Name": "webhook", "Alert-Name": "alert", "Schedule-Name": "[uniquestring(variables('Search-Name'))]", "Search-Name": "[toLower(concat(variables('Search-Category'), '|', parameters('Search-DisplayName')))]", "Search-Id": "[resourceId('Microsoft.OperationalInsights/workspaces/savedSearches', parameters('Workspace-Name'), variables('Search-Name'))]", "Schedule-Id": "[resourceId('Microsoft.OperationalInsights/workspaces/savedSearches/schedules', parameters('Workspace-Name'), variables('Search-Name'), variables('Schedule-Name'))]", "Alert-Id": "[resourceId('Microsoft.OperationalInsights/workspaces/savedSearches/schedules/actions', parameters('Workspace-Name'), variables('Search-Name'), variables('Schedule-Name'), variables('Alert-Name'))]", "Action-Id": "[resourceId('Microsoft.OperationalInsights/workspaces/savedSearches/schedules/actions', parameters('Workspace-Name'), variables('Search-Name'), variables('Schedule-Name'), variables('Action-Name'))]" }, "resources": [ { "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [], "properties": { "Category": "[parameters('Search-Category')]", "DisplayName": "[parameters('Search-DisplayName')]", "Query": "[parameters('Search-Query')]", "Version": "1" } }, { "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'), '/', variables('Schedule-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches/schedules", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [ "[variables('Search-Id')]" ], "properties": { "Interval": 5, "QueryTimeSpan": 5, "Active": "true" } }, { "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'), '/', variables('Schedule-Name'), '/', variables('Alert-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches/schedules/actions", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [ "[variables('Schedule-Id')]" ], "properties": { "Type": "Alert", "Name": "[parameters('Search-DisplayName')]", "Threshold": { "Operator": "gt", "Value": 0 }, "Version": 1 } }, { "name": "[concat(parameters('Workspace-Name'), '/', variables('Search-Name'), '/', variables('Schedule-Name'), '/', variables('Action-Name'))]", "type": "Microsoft.OperationalInsights/workspaces/savedSearches/schedules/actions", "apiVersion": "2015-03-20", "location": "[parameters('Workspace-Location')]", "dependsOn": [ "[variables('Schedule-Id')]" ], "properties": { "Type": "Webhook", "Name": "[variables('Action-Name')]", "WebhookUri": "[parameters('Action-Uri')]", "CustomPayload": "[parameters('Action-Payload')]", "Version": 1 } } ], "outputs": {} }
Windows Server 2016 RTM and System Center 2016 RTM are now generally available
Windows Server 2016 RTM and System Center 2016 RTM are now live now generally available through the usual sources, including Technet and MSDN. This coincides with the release of a Server 2016 RTM IaaS offering in Azure which is now open to all subscriptions.
Topic Search
-
Securing TLS in WAC (Windows Admin Center) https://t.co/klDc7J7R4G
Posts by Date
- August 2025 1
- March 2025 1
- February 2025 1
- October 2024 1
- August 2024 1
- July 2024 1
- October 2023 1
- September 2023 1
- August 2023 3
- July 2023 1
- June 2023 2
- May 2023 1
- February 2023 3
- January 2023 1
- December 2022 1
- November 2022 3
- October 2022 7
- September 2022 2
- August 2022 4
- July 2022 1
- February 2022 2
- January 2022 1
- October 2021 1
- June 2021 2
- February 2021 1
- December 2020 2
- November 2020 2
- October 2020 1
- September 2020 1
- August 2020 1
- June 2020 1
- May 2020 2
- March 2020 1
- January 2020 2
- December 2019 2
- November 2019 1
- October 2019 7
- June 2019 2
- March 2019 2
- February 2019 1
- December 2018 3
- November 2018 1
- October 2018 4
- September 2018 6
- August 2018 1
- June 2018 1
- April 2018 2
- March 2018 1
- February 2018 3
- January 2018 2
- August 2017 5
- June 2017 2
- May 2017 3
- March 2017 4
- February 2017 4
- December 2016 1
- November 2016 3
- October 2016 3
- September 2016 5
- August 2016 11
- July 2016 13