System architecture opinions sought

**AtW** · 7 May 2009, 12:02

Originally posted by TheRefactornator View Post

These remote data gathering nodes have the potential to send up to 5 transactions per second to the core for processing, but it's likely to be less. Each single transaction will be small serialized xml data, probably around 50 bytes.

A few words of advice:
1) use compact binary data rather XML
2) cache transactions on nodes and send them every second (so one packet with 5 transactions inside it - this will yield big performance savings).
3) use async sockets
4) preprocess data on nodes if possible (this way central server will need to do less work)

Your most expensive part will be XML fluff, avoid it - maybe only push it to your final 20 clients, but keep to simple binary for the rest.

SKA is written in .NET and it handles 200+ nodes sending data at sustained rate of 10 Mbits: CPU usage is nearly 0.

**TheRefactornator** · 7 May 2009, 12:32

Originally posted by AtW View Post

A few words of advice:
1) use compact binary data rather XML
2) cache transactions on nodes and send them every second (so one packet with 5 transactions inside it - this will yield big performance savings).
3) use async sockets
4) preprocess data on nodes if possible (this way central server will need to do less work)

Your most expensive part will be XML fluff, avoid it - maybe only push it to your final 20 clients, but keep to simple binary for the rest.

SKA is written in .NET and it handles 200+ nodes sending data at sustained rate of 10 Mbits: CPU usage is nearly 0.

Thanks for the advice. In response:

1) Do you know of any c# object-> serialize to binary -> deserialize back to object code I can look at? I've always used xml for this type of work in the past but you're right it is an overhead I can do without here.
2) Good advice. I already considered this would be a good optimization and it means that the potential 5000tps is now reduced to 1000tps.
3) My IOCP sockets method will be even more efficient than standard .NET async sockets so I'm happy with this part of the app.
4) I might be able to do some pre-processing at the data gathering nodes but I'm not sure of the processing power I'll have there yet. The node's main task is going to be servicing a number of serial (RS232) data hiways and the platform could well be an embedded device running windows CE / compact framework so in that case it could be up to the job.

Any thoughts on load balancing hardware? I'm aware of the Barracuda 640 that is able to handle 2000tps and redundant configuration for high availability http://www.barracudanetworks.com/ns/...cer_models.php but I would be interested to learn about other possible options.

**AtW** · 7 May 2009, 12:50

Originally posted by TheRefactornator View Post

1) Do you know of any c# object-> serialize to binary -> deserialize back to object code I can look at? I've always used xml for this type of work in the past but you're right it is an overhead I can do without here.

Don't use serialization at all - it's too heavy: convenient but you pay cost of performance.

Create your class that will know itself how to write/read data from binary stream, you can't possibly keep lots of data in 50 bytes.

2) Good advice. I already considered this would be a good optimization and it means that the potential 5000tps is now reduced to 1000tps.

Real size of packets is likely to be around 1500 bytes (that MTU ethernet thingy), calls, interrupts etc are all best minimised, so bigger packets are preferable to high number of packets, it all depends how real time you want the system to work: for most people 1 second is pretty real time.

3) My IOCP sockets method will be even more efficient than standard .NET async sockets so I'm happy with this part of the app.

Asyn .NET sockets already use completion ports - don't try to reinvent the wheel in this area - first version of SKA was using .NET 1.1 and they improved sockets in .NET 2.0 and also in .NET 3.5.

Any thoughts on load balancing hardware? I'm aware of the Barracuda 640 that is able to handle 2000tps and redundant configuration for high availability http://www.barracudanetworks.com/ns/...cer_models.php but I would be interested to learn about other possible options.

Can't help you with that - I write efficient enough software to avoid the need for load balancing...

One more thing - if you intend to do heavy data processing on main server make sure you do it in a separate thread with lower CPU priority than threads that deal with communications - never try to do heavy processing in socket callback.

HTH

P.S. Avoid using MemoryStream objects - this garbage collection "guarantee" that you won't have memory leaks is a pack of tulips.

**Churchill** · 7 May 2009, 13:01

Real size of packets is likely to be around 1500 bytes (that MTU ethernet thingy), calls, interrupts etc are all best minimised, so bigger packets are preferable to high number of packets, it all depends how real time you want the system to work: for most people 1 second is pretty real time.

What is your definition of "Realtime" aTw?

**AtW** · 7 May 2009, 13:11

Originally posted by Churchill View Post

What is your definition of "Realtime" aTw?

Actual length of "real-time" depends on a particular situation, in this case 1 second should be pretty real time (0.2 sec updates are not real time in embedded sense)

**DimPrawn** · 7 May 2009, 13:19

With regards to the data, what happens if data is lost, do you need guaranteed delivery of data to the server? Retries? Is the order the data is received important?

Would bog std Windows Server NLB give you the scalability you require?

**TheRefactornator** · 7 May 2009, 13:23

Originally posted by AtW View Post

Actual length of "real-time" depends on a particular situation, in this case 1 second should be pretty real time (0.2 sec updates are not real time in embedded sense)

In fact the system being discussed here is a large distributed model of an existing smaller localized system where UI latency actually is 1 second. 1 second has always been perceived by the users and the system vendor to be real-time in this application.

**AtW** · 7 May 2009, 13:24

Originally posted by TheRefactornator View Post

In fact the system being discussed here is a large distributed model of an existing smaller localized system where UI latency actually is 1 second. 1 second has always been perceived by the users and the system vendor to be real-time in this application.

So looks like I got the situation right (term "real-time" is different in length depending on context) and our local dawggy got it wrong...

**TheRefactornator** · 7 May 2009, 13:32

Originally posted by DimPrawn View Post

With regards to the data, what happens if data is lost, do you need guaranteed delivery of data to the server? Retries? Is the order the data is received important?

Would bog std Windows Server NLB give you the scalability you require?

It is not too critical if data is lost occasionally because the data gathering nodes will be running a loop forever gathering data from remote field devices over RS232/485 network, but it will only be only delta change that a transaction will be created for transport to the core. Obviously data loss should be minimal but if it happens it won't be a disaster..order of data received is not important at all.

The serial comms layer will implement it's own application level retry mechanism, but to keep TCP/IP network traffic to a minimum the socket layer is unlikely to be implemented in a request/response type arrangement. It's more likely to simply be one where the data gatherer packages up a handful of transactions, sends and forgets until next time.

I really have no idea about Windows Server NLB and it's capabilities but thanks for the pointer - it is something to consider.

System architecture opinions sought