Arachni: Discussion

Arachni RPC running with many bugs.

2017-06-19T10:31:45Z

One thing i might note that is not clear:
One of the problems can be the arachni_rpc stops working and everytime you try to start it, it will just hang without doing any actual work.

Other times it seems as the connection is lost and arachni_rpc client is unable to get any reports. I can see that the scans are still running through arachni_rpcd_monitor.

Arachni RPC running with many bugs.

2017-06-19T15:39:11Z

The way you've setup the grid no other Dispatcher than arachni-master will be used due to the high weight you've assigned to it.
Also, try removing the --spawns option, it's unstable and will be removed.

Arachni RPC running with many bugs.

2017-06-19T16:38:33Z

My bad, got it backwards, arachni-master will never be used.

Arachni RPC running with many bugs.

2017-06-19T21:30:54Z

Thanks. Will try without the --spawns. It just stated that i had to specify it.

In regards to the arachni-master thing. We did that specifically in order to not get a heavy load on the arachni-master as it is managing the scans.

Arachni RPC running with many bugs.

2017-06-19T22:07:41Z

Don't specify --grid either so that you won't have to specify --spawns, the Dispatchers will load balance the scans amongst themselves still.

Also, arachni-master isn't managing anything, no one node is more important that the others. Whichever one you ask it'll search the Grid for the one with the lowest workload score, ask it for an Instance and then pass that information back to you.

Arachni RPC running with many bugs.

2017-06-22T14:28:40Z

Still very unstable. It is like the arachni_rpc is loosing it's connection to the dispatcher(After pressing ctrl+c it will hang forever)
And then some of the new scans started with rpc will not start.

I don't know if it is caused by ram consumption but some of our servers will also just become completely unresponsive in some cases. I just think it's hard to control ram when it is balancing a lot of scans by itself.

Do you have any ideas or any debug info that i could provide that would be helpfull ?

Arachni RPC running with many bugs.

2017-06-22T14:31:55Z

I think you should run fewer scans, sounds like the servers are having a pretty hard time.
Out of curiosity, how many scans are you running on these machines?

Arachni RPC running with many bugs.

2017-06-23T08:01:05Z

Maybe one or 2 scans each. So around 5-10 scans for 5 servers with good specs.

Arachni RPC running with many bugs.

2017-06-23T08:24:35Z

Also when using rpcd monitor i can see a scan is running but the arachni_rpc is dead.
Can i stop the scanning on the rpcd on a dispatcher without restarting the whole thing ?

Arachni RPC running with many bugs.

2017-06-23T08:28:52Z

Also noticed that the timeout feature is not working. If i state 48 hours it does not help and the scan just continues

Arachni RPC running with many bugs.

2017-06-23T08:35:23Z

2 scans per machine is really low, I can run 12 scans easy, one for each CPU core. Are you sure it's not a network issue? Also, how much CPU % are the scans using when things start to lag?
The scan can be killed just like any other process, the monitor should give you the PID.
If connectivity is lost like you mentioned then the time-out won't work as it's controlled by the client.

Arachni RPC running with many bugs.

2017-06-23T12:09:24Z

That is good to hear. Around 60% to 80%. I would not consider it as lagging but more as the client unable to contact the dispatcher. Also some of the boxes are completely dead meaning i can't even SSH into them. In google cloud console they show ~0% cpu at that time though.
Good to know thanks.
I understand. It is hard for me to believe that it is a network issue though, given that it is build in google cloud and the servers are next to eachother..

Arachni RPC running with many bugs.

2017-06-23T12:17:13Z

The boxes being completely dead is worrisome., can you perform an identical scan and periodically check the amount of running processes and disk usage in addition to RAM and CPU?

Theoretically there could be a bug in the way browsers are spawned, leading to basically a fork bomb or the tmp files Arachni creates to offload workload to disk could be taking up all the space.

Arachni RPC running with many bugs.

2017-06-23T12:23:09Z

Just to make a quick note. By completely dead i mean hard reset is the only way.
I have just started ~20 scans and will monitor the RAM + CPU consumption + amount of running processes.

The tmp files taking all the space is definately a worthy shot. I have 4 scans running for 10-20 minutes on one of the machines and 1.7gigs left of diskspace. Will it exceed that ?

Also thanks for the quick replys. They are greatly appreciated.

Arachni RPC running with many bugs.

2017-06-23T12:29:01Z

Yeah tmp files can easily exceed 1.7GB.

Recommended system requirements state 10GB of available disk space and that's per scan -- that's on the very generous side I'll grant you, but still.

There are cases where disk space can grow even past that and that's a sign of trouble, but it can be mitigated via configuration. We'll cross that bridge when we come to it though.

Arachni RPC running with many bugs.

2017-06-23T12:34:52Z

So for now it would be okay to up the servers to 40gigs og disk space each in order to run 4 scans per server ?

Arachni RPC running with many bugs.

2017-06-23T12:35:46Z

Yep, give that a shot and see if it makes a difference.

Arachni RPC running with many bugs.

2017-06-26T09:08:23Z

Tried with increasing all the disks to 40 gigs. Ran 5 scans each per server with 4 cores each. I am now currently unable to contact any of the 5 servers here monday morning.

As they are in google cloud i cannot currently see their disk or ram usage but i'm just assuming that disk errors are the problem.

Do you think i pressed them too much with a total of 30 scans?

Arachni RPC running with many bugs.

2017-06-26T09:15:14Z

Yeah, better stick with one scan per core.
Also, while the scans are running can you try watch -n1 dfand watch -n1 free over SSH? At the point where it gets stuck we'll know how things look resource-wise.

Arachni RPC running with many bugs.

2017-06-26T10:56:03Z

total used free shared buff/cache available
Mem: 15400392 14751468 518416 8768 130508 398496
Swap: 0 0 0

Managed to SSH into one of them again, so they are not completely dead. Looks like memory is close to zero though.
Will try to monitor disk and memory with only 1 scan per core.

I can see that arachni_rpc produces no output anymore but on the arachni-dispatcher it is still scanning. Can i get the reports somewhere when it finishes or are these lost?

Arachni RPC running with many bugs.

2017-06-27T08:34:28Z

Hi again,

Currently scanning only 3 applications per dispatcher.
Have set up a 5 gig swapfile so we are currently on 21 gig ram.
But i then got some error logs returned and grepped after memory
10.20.50.10_20992.error.log:[2017-06-27 03:51:39 +0000] [Errno::ENOMEM] Cannot allocate memory - /opt/arachni/arachni-1.5-0.5.11/system/usr/bin/ruby
10.20.50.15_18939.error.log:[2017-06-27 08:16:21 +0000] [Errno::ENOMEM] Cannot allocate memory - /opt/arachni/arachni-1.5-0.5.11/system/usr/bin/ruby
10.20.50.15_23423.error.log:[2017-06-27 06:27:49 +0000] [Errno::ENOMEM] Cannot allocate memory - /opt/arachni/arachni-1.5-0.5.11/system/usr/bin/ruby

Could mem leaks be the error ?
This is our current scan:
arachni_rpc --dispatcher-url=10.20.50.8:7331 --browser-cluster-ignore-images --scope-auto-redundant=4 --timeout=48:00:00 --report-save-path=/opt/arachni/reports/uuid.afr --http-request-queue-size=50 --browser-cluster-pool-size=4 --checks=,-common_,-backup_*,-backdoors

Arachni RPC running with many bugs.

2017-06-27T14:05:12Z

That's a lot of RAM, there could be a leak in the scanner or it could be that one of the scans just needs a lot of memory, it depends on the web application.

Arachni RPC running with many bugs.

2017-06-28T11:39:17Z

So if there is a leak in the scanner how do i fix it ?

Arachni RPC running with many bugs.

2017-06-28T14:41:39Z

You can try playing with the --scope options, especially the --scope-dom ones.

Unfortunately as far as the scanner is concerned I've gotten Arachni as far as I can take it, which is why I've been working on a new engine which will solve these kinds of issues.

And since you're using the Grid, one new feature of the new engine that was added after I made the blog post, is a much smarter Grid that is aware of available system resources and automatically calculates the amount of scans that can be safely performed in parallel so it won't let you shoot yourself in the foot.

Also, a new queue system has been implemented to which you can post as many scan jobs as you wish and it will safely distribute and manage them for you.

Unfortunately, I haven't got an ETA for the new engine, it will probably be a while before a beta is available.

Until then, try experimenting with the available options and have a look at this article: http://support.arachni-scanner.com/kb/general-use/optimizing-for-fa...

Arachni RPC running with many bugs.

2017-06-29T08:33:28Z

Okay fair enough.
I will be trying the --scope-dom ones. A bit clueless as i have no idea how many js events are needed in --scope-dom-event-limit

Arachni RPC running with many bugs.

2017-06-29T08:58:31Z

Also one last thing:
I asked before but no answer: I can see that arachni_rpc produces no output anymore but on the arachni-dispatcher it is still scanning. I can see that through arachni_rpcd_monitor. Can i get the reports somewhere when it finishes or are these lost if arachni_rpc cannot gather the result ?

Best regards,
Kevin

Arachni RPC running with many bugs.

2017-06-29T09:31:49Z

And are there any point in using the 2.0 development or nightlies or is that far fetched to fix the problem ?

Arachni RPC running with many bugs.

2017-06-29T14:24:14Z

Memory output
8698 root 20 0 6741740 2.655g 26148 S 0.3 18.1 21:12.44 phantomjs
5767 root 20 0 4594416 1.360g 9504 S 19.3 9.3 8:21.87 phantomjs
6202 root 20 0 4621088 1.253g 10932 S 19.6 8.5 8:02.36 phantomjs
10137 root 20 0 3673920 777200 9428 S 20.3 5.0 4:53.24 phantomjs
9885 root 20 0 3578212 728336 10660 R 18.9 4.7 5:08.96 phan

Arachni RPC running with many bugs.

2017-06-29T22:08:13Z

FIXED
For anyone else wondering what the issue was, it was phantomjs. The setting "ignore images" should not be used.

The bug is years old and described in https://github.com/ariya/phantomjs/issues/12903

Should be noted in the docs that the "ignore images" contains a bug. Anyway thanks for all the other suggestions Tasos.

Arachni RPC running with many bugs.

2017-07-01T11:19:53Z

Glad you identified the issue, I hadn't heard of this before. I may need to disable this option in Arachni.