HCHTech
Well-Known Member
- Reaction score
- 4,185
- Location
- Pittsburgh, PA - USA
I have a small construction business customer. They are out in the sticks a bit, so DSL is their only option. They have a an SBS2011 server and a small Sonicwall. The symptom that generated the call was an inability to get into their payroll website. Other sites worked fine, but the payroll site wouldn't load in any browser - They could go through the login process, but the site would just timeout and you'd get the browser's "check your networking" message.
Everything was working fine until Friday. I couldn't get out there until today, but I had them try a few things, reboot equipment, flush DNS, reset networking, using a public DNS, that kind of stuff. We also found some other sites that wouldn't load - for example the CNN main site would load, but the "Money" site woudn't. This was across all of their computers. I did note that the site would still not load using the IP address, so the problem wasn't DNS.
I remoted in and checked the Sonicwall, rebooted it, upgraded the firmware, nothing helped. They called their DSL vendor and of course - they said their connection was fine. I somehow talked them through configuring a laptop for the PPPoE connection to test things outside of the Sonicwall, and sure enough - all sites loaded fine. Rats - I was hoping to find out it was really their fault.
I tried disabling the various content and packet filtering, but no change. It was getting late and I was getting a headache from all of this so I told him I would come onsite with a replacement firewall today. Their unit would be 5 years old in June, so maybe it's time for a new one anyway.
In the middle of dinner with friends on Saturday, it finally dawns on me that I should test changing the MTU value of the WAN connection.
I go out this morning and sure enough, lowering the MTU value from 1500 (which worked just fine for the 4-and-a-half years that Firewall had been there) to 1492 solved the problem. Now why couldn't I have thought of that in the first place?
So anyway, I thought I'd go through the steps to find the maximum MTU value for a connection in case someone else might benefit.
Pick a website and run a ping as follows:
Ping www.xxxxxxx.com -f -l 1500
-f means "don't fragment the packet". -l 1500 means "use a send buffer of 1500"
Look at the results. If you get "Packet needs to be fragmented but DF set", then you need to decrease the buffer size. If you get normal ping result, then you need to increase the buffer size.
Run the command again either decreasing or increasing the buffer size. You are looking for the maximum buffer size that does not return the "needs to be fragmented" message.
Once you find the maximum buffer size that doesn't fragment, add 28 to this number to account for the packet header and this is your correct MTU value. Adjust the MTU to equal this number in your modem/router/firewall.
I had known about this whole process as a way to optimize your connection, but hadn't really used it much in several years. I certainly didn't have it at the ready for a solution to the "some sites don't load" problem.
I would also note that this whole problem must have come about because the DSL vendor changed something about their connection that resulted in the requirement for a lower MTU setting. You would think tech support might have mentioned that when they called!
I hope this helps someone!
Everything was working fine until Friday. I couldn't get out there until today, but I had them try a few things, reboot equipment, flush DNS, reset networking, using a public DNS, that kind of stuff. We also found some other sites that wouldn't load - for example the CNN main site would load, but the "Money" site woudn't. This was across all of their computers. I did note that the site would still not load using the IP address, so the problem wasn't DNS.
I remoted in and checked the Sonicwall, rebooted it, upgraded the firmware, nothing helped. They called their DSL vendor and of course - they said their connection was fine. I somehow talked them through configuring a laptop for the PPPoE connection to test things outside of the Sonicwall, and sure enough - all sites loaded fine. Rats - I was hoping to find out it was really their fault.
I tried disabling the various content and packet filtering, but no change. It was getting late and I was getting a headache from all of this so I told him I would come onsite with a replacement firewall today. Their unit would be 5 years old in June, so maybe it's time for a new one anyway.
In the middle of dinner with friends on Saturday, it finally dawns on me that I should test changing the MTU value of the WAN connection.
I go out this morning and sure enough, lowering the MTU value from 1500 (which worked just fine for the 4-and-a-half years that Firewall had been there) to 1492 solved the problem. Now why couldn't I have thought of that in the first place?
So anyway, I thought I'd go through the steps to find the maximum MTU value for a connection in case someone else might benefit.
Pick a website and run a ping as follows:
Ping www.xxxxxxx.com -f -l 1500
-f means "don't fragment the packet". -l 1500 means "use a send buffer of 1500"
Look at the results. If you get "Packet needs to be fragmented but DF set", then you need to decrease the buffer size. If you get normal ping result, then you need to increase the buffer size.
Run the command again either decreasing or increasing the buffer size. You are looking for the maximum buffer size that does not return the "needs to be fragmented" message.
Once you find the maximum buffer size that doesn't fragment, add 28 to this number to account for the packet header and this is your correct MTU value. Adjust the MTU to equal this number in your modem/router/firewall.
I had known about this whole process as a way to optimize your connection, but hadn't really used it much in several years. I certainly didn't have it at the ready for a solution to the "some sites don't load" problem.
I would also note that this whole problem must have come about because the DSL vendor changed something about their connection that resulted in the requirement for a lower MTU setting. You would think tech support might have mentioned that when they called!
I hope this helps someone!
Last edited: