YeOldeStonecat
Well-Known Member
- Reaction score
- 6,546
- Location
- Englewood Florida
I have this over at Ubiquities forums....copy 'n paste here...
I'm leaning towards...some issue with the stacking of the ProCurve switches...maybe someone who lives and breaths enterprise switch setups can chime in (caling netwizz?)
******************************************
So I've had this larger network in place for a few years, it is in a spread out building. It used to have a bunch of those Linksys/Cisco small business series SRW switches...switches uplinked with fiber at a gig, from the wings of the building...to the central area where the "core switch" was. About 2 years ago we put in 17x Unifi APs.
It had run great for a couple of years. In some spots we have the ToughSwitches powering the UAPs..in other spots...a few solo APs just ran from POE injectors. Had 2x SSIDs.
Network was 192.168.9.xxx
They are moving to an IP phone system, adding many more computers, and needing a faster network. So a combination of 2 things happens...
*They have wiring guys come in and add runs, relocate runs
*We come in...remove the Linksys/Cisco SRW switches...and put in HP ProCurve 2900 series (I think they were 2900 series)...a 24 porter at the top of the rack, 2x 48 port models under it,...and the 10 gig switch uplinks across those 3 switches. And in the 3 wings of the building...3x more 48 port models, with the 10 gig fiber uplinks.
Within a day or so...reports of the wireless not working in many areas. We've had them redirected to our cloud controller...which we have about 40x other of our clients on...running fine. This client has been on this cloud controller of ours for ~2 years fine.
Since we switched over to the ProCurves...wired computers..desktops...run great. No problems. But the wireless system...clients just...stop passing traffic. Windows 7 clients get that yellow exclamation mark in the wireless status. Unable to browse/connect to local resources, and unable to browse/connect to the internet.
Sometimes the connection comes back fine. And then..fades back out...
If we reboot the 3x core ProCurves...everything comes back. And it can run fine for a day...or two days...sometimes 3 days. And then the symptoms creep back.
When the wireless symptoms arise...the wired computers are fine.
I set management IPs for the ProCurve switches in the 192.168.168.xxx range.
Routing done at the edge device (Untangle)...
Default VLAN for the production network
When I put the ProCurves in...I added a 3rd SSID, for "Guest"...with the intention of doing that on VLAN 6.
I hadn't gotten to setting up VLAN tagging all the way through on the ProCurves yet...by the time the symptoms started.
DHCP pool has plenty of room...but regardless, I chopped that down from default 8 days..to 4 days..and then to 1 day.
Our Unifi controller was running 4.something firmware...whatever was current last fall. This morning I upgraded to the latest. No help...symptom returned this afternoon.
When the symptom comes up...if I go to a wireless client..here's the wierd part. If I ping a public IP, such as an OpenDNS server (208.67.222.222), or Comcasts DNS server (75.75.75.75)...I get replies.
Pings usually reply about 75% of the time. Other times....times out. And latency tends be high.
If I ping the gateway, 192.168.9.1....nada.
If I ping one of the local DCs...the DNS servers for the LAN, nada.
Which explains why if I ping a publc DNS name...like www.google.com or www.msn.com...cannot find.
I am pinging this from the standard production wireless SSID...which does not have guest policy applied.
In the guest network definition, blocked networks...I had added 192.168.9.0/24. I removed that today...symptoms still kept up.
Rebooting the UAPs themselve...usually doesn't do much.
However..reboot the 3x core ProCurves..symptoms go away.
The 3x ProCurves in the server room are "stacked".
I currently have 4x SSIDs setup...2x on the production lan, and 2x with "guest" policy applied...1 of which is on VLAN6....untags on the top ProCurve which has a patch cable going to a 3rd ethernet interface on the Untangle firewall...which is doing DHCP....for a separate subnet..192.168.10.xxx
But the symptoms where present before setting this up.
Rather early on...we turned off STP.
I'm ripping what's left of my hair out over this..it's acting like a loop-back is happening on the wireless part.
one odd thing I noticed this morning after I upgraded the Unifi controller..and then pushed out the firmware updates...2 of the UAPs gave a message about "must upgrade parent access point first". But all UAPs are conneected via ethernet. It's almost like that "mesh" feature is going on..but none of the UAPs show up in the controller as wireless linked. All are wired.
Frustrated. This network ran great on the cheap switches. And we have lots of other clients using ProCurve models...1800 series and a few on the higher series...with Unifi APs all around. No problem with them. We've come to love the Unifi APs for years.
Added note....currently (and since the setup 2 years ago)..the UAPs were dynamic IP. But haven't had a problem letting them be dynamic.
I'm leaning towards...some issue with the stacking of the ProCurve switches...maybe someone who lives and breaths enterprise switch setups can chime in (caling netwizz?)
******************************************
So I've had this larger network in place for a few years, it is in a spread out building. It used to have a bunch of those Linksys/Cisco small business series SRW switches...switches uplinked with fiber at a gig, from the wings of the building...to the central area where the "core switch" was. About 2 years ago we put in 17x Unifi APs.
It had run great for a couple of years. In some spots we have the ToughSwitches powering the UAPs..in other spots...a few solo APs just ran from POE injectors. Had 2x SSIDs.
Network was 192.168.9.xxx
They are moving to an IP phone system, adding many more computers, and needing a faster network. So a combination of 2 things happens...
*They have wiring guys come in and add runs, relocate runs
*We come in...remove the Linksys/Cisco SRW switches...and put in HP ProCurve 2900 series (I think they were 2900 series)...a 24 porter at the top of the rack, 2x 48 port models under it,...and the 10 gig switch uplinks across those 3 switches. And in the 3 wings of the building...3x more 48 port models, with the 10 gig fiber uplinks.
Within a day or so...reports of the wireless not working in many areas. We've had them redirected to our cloud controller...which we have about 40x other of our clients on...running fine. This client has been on this cloud controller of ours for ~2 years fine.
Since we switched over to the ProCurves...wired computers..desktops...run great. No problems. But the wireless system...clients just...stop passing traffic. Windows 7 clients get that yellow exclamation mark in the wireless status. Unable to browse/connect to local resources, and unable to browse/connect to the internet.
Sometimes the connection comes back fine. And then..fades back out...
If we reboot the 3x core ProCurves...everything comes back. And it can run fine for a day...or two days...sometimes 3 days. And then the symptoms creep back.
When the wireless symptoms arise...the wired computers are fine.
I set management IPs for the ProCurve switches in the 192.168.168.xxx range.
Routing done at the edge device (Untangle)...
Default VLAN for the production network
When I put the ProCurves in...I added a 3rd SSID, for "Guest"...with the intention of doing that on VLAN 6.
I hadn't gotten to setting up VLAN tagging all the way through on the ProCurves yet...by the time the symptoms started.
DHCP pool has plenty of room...but regardless, I chopped that down from default 8 days..to 4 days..and then to 1 day.
Our Unifi controller was running 4.something firmware...whatever was current last fall. This morning I upgraded to the latest. No help...symptom returned this afternoon.
When the symptom comes up...if I go to a wireless client..here's the wierd part. If I ping a public IP, such as an OpenDNS server (208.67.222.222), or Comcasts DNS server (75.75.75.75)...I get replies.
Pings usually reply about 75% of the time. Other times....times out. And latency tends be high.
If I ping the gateway, 192.168.9.1....nada.
If I ping one of the local DCs...the DNS servers for the LAN, nada.
Which explains why if I ping a publc DNS name...like www.google.com or www.msn.com...cannot find.
I am pinging this from the standard production wireless SSID...which does not have guest policy applied.
In the guest network definition, blocked networks...I had added 192.168.9.0/24. I removed that today...symptoms still kept up.
Rebooting the UAPs themselve...usually doesn't do much.
However..reboot the 3x core ProCurves..symptoms go away.
The 3x ProCurves in the server room are "stacked".
I currently have 4x SSIDs setup...2x on the production lan, and 2x with "guest" policy applied...1 of which is on VLAN6....untags on the top ProCurve which has a patch cable going to a 3rd ethernet interface on the Untangle firewall...which is doing DHCP....for a separate subnet..192.168.10.xxx
But the symptoms where present before setting this up.
Rather early on...we turned off STP.
I'm ripping what's left of my hair out over this..it's acting like a loop-back is happening on the wireless part.
one odd thing I noticed this morning after I upgraded the Unifi controller..and then pushed out the firmware updates...2 of the UAPs gave a message about "must upgrade parent access point first". But all UAPs are conneected via ethernet. It's almost like that "mesh" feature is going on..but none of the UAPs show up in the controller as wireless linked. All are wired.
Frustrated. This network ran great on the cheap switches. And we have lots of other clients using ProCurve models...1800 series and a few on the higher series...with Unifi APs all around. No problem with them. We've come to love the Unifi APs for years.
Added note....currently (and since the setup 2 years ago)..the UAPs were dynamic IP. But haven't had a problem letting them be dynamic.