Commit Graph

1439 Commits

Author SHA1 Message Date
John R. Daily
901b90ce9a Add new test for down nodes, add more comments 2015-07-16 13:28:36 -04:00
John R. Daily
bd738dea27 Verify error messages by asking for replacements for n=1 chunks 2015-07-16 11:23:26 -04:00
John R. Daily
060563cb89 Ask for very fine-grained plan 2015-07-16 10:36:08 -04:00
John R. Daily
818a3e56ea Start testing traditional coverage with n_val of 4 and 5. Measure resulting plan and compare against ring size 2015-07-16 10:11:49 -04:00
John R. Daily
30d29085f2 Proper matching based on cover context rather than port, bugfix this revealed 2015-07-15 18:05:24 -04:00
John R. Daily
b9b4e1d2ea First test for new parallel coverage queries 2015-07-15 17:23:04 -04:00
Fred Dushin
3fa57b4025 Added a cap to the number of messages to be send asynchronously, to give handoff a chance to complete. 2015-07-07 17:25:49 -04:00
Fred Dushin
fbedd4cd33 Tried to make the verify_handoff_write_once test more predictable 2015-07-07 16:45:05 -04:00
Zeeshan Lakhani
cf0b665854 update yokozuna crdt test from master that works for current 2.0 branch and 2.0.6 release 2015-07-07 13:11:14 -04:00
Zeeshan Lakhani
b68ffbc9be update some yokozuna tests from work done in riak/2.0 branch for 2.0.6 release 2015-07-07 13:01:42 -04:00
Fred Dushin
f330f867b2 Reverted to original verify_handoff_write_once test with modifications
* Backed out the change to riak_core_handoff_sender_intercepts which had
  a modified intercept
* Modified the handle_command intercept to ensure we don't count the
  vnode crashes that were happening before the fix
* Reverted the test to the original, but check to make sure we can read
  all the entries, including the ones we added during handoff
2015-07-02 11:48:20 -04:00
Fred Dushin
06836392cd Re-write of handle_handoff_write_once to test handle_handoff_command.
Still needs some work!
2015-07-02 10:59:01 -04:00
Christopher Meiklejohn
892e8c5744 Use correct shutdown.
No longer use the shutdown command when starting, which causes the use
of the incorrect version of shutdown (shutdown from current, not
previous) which generates an error.

{badrpc,{'EXIT',{{badfun,#Fun<erl_eval.20.80484245>},
                 [{erlang,apply,2,[]},
                  {rpc,'-handle_call_call/6-fun-0-',5,
                       [{file,"rpc.erl"},{line,203}]}]}}}
2015-06-26 00:07:24 -04:00
Christopher S. Meiklejohn
facfaf128c Merge pull request #804 from basho/bugfix/jdm/wait-for-capabilities-to-converge-for-667-mixed
Wait for ring properties to converge before testing new behavior.
2015-06-25 17:13:16 -07:00
Jon Meredith
373f88a0ef Wait for ring properties to converge before testing new behavior. 2015-06-25 17:58:59 -06:00
Zeeshan Lakhani
203efbbce8 add brief module notes 2015-06-25 15:28:28 -07:00
John R. Daily
aa1ab75da4 Pin these tests to 2.0.5 2015-06-25 15:28:22 -07:00
Christopher Meiklejohn
5b22604d0e Remove write_once.
write_once is not present on the 2.0 series of Riak.
2015-06-25 11:25:09 -04:00
Fred Dushin
cff0734aef Fixes for YZ tests against 2.0 2015-06-18 23:24:42 -04:00
Fred Dushin
57346dd366 Merge pull request #794 from basho/zl/clean-up-yz-core-prop-test
adding clarity to test
2015-05-29 16:29:18 -04:00
Zeeshan Lakhani
e86616f38d handle possible race conds w/ trace checker and log out values too for extractor checks 2015-05-27 18:10:30 -04:00
Fred Dushin
a8f00eb46c Removed verify_num_found_query function from yz_extractors and changed test
to use Zeeshan's version by the same name in yokozuna_rt
2015-05-27 16:08:50 -04:00
Zeeshan Lakhani
ad5004c0df *add include file for types
*refactor yokozuna_rt some more
2015-05-27 10:36:18 -04:00
Zeeshan Lakhani
54568a0c27 * rename file to reduce conflict on buildbot runs
* wait_for_index on write data due to use in upgrade tests
* clean-up other tests to that test upgrade from (2.0.5->2.0.6/2.1.1->2.1.2), namely yz_extractors
* clean-up tests that can start using yokozuna_rt helpers
2015-05-27 10:25:05 -04:00
Zeeshan Lakhani
e6e8d9403c * test default bucket type upgrade on change to entropy iteration... and move assert cap checks to rt.erl
* cleaner setup as we deal w/ immediate version needs
2015-05-27 10:25:05 -04:00
Christopher Meiklejohn
08ef4cfb20 Fix typo. 2015-05-20 16:38:34 -04:00
Christopher Meiklejohn
b2942aa436 Remove export all. 2015-05-20 11:30:30 -04:00
Christopher Meiklejohn
d2173bb341 Add separate test for AAE fullsync with bucket types. 2015-05-20 11:29:50 -04:00
Zeeshan Lakhani
3ee669ee24 adding clarity to test 2015-05-18 10:57:49 -04:00
Sean Cribbs
2e7f093532 Hopefully fix the disconnection issue in the riak667_mixed test 2015-05-07 11:42:48 -05:00
Zeeshan Lakhani
1a5204fd18 update yz_handoff test after better handling of default bucket_types in Yokozuna PR #486 2015-05-01 08:35:37 +09:00
Zeeshan Lakhani
96c2f35227 * add some yz wait_until helper fns
* add count_calls and helpers for tracing fn calls (with dbg)
* add yz_extractors test to check for updates made in RIAK-1673
* add yz_noop_extractor intercept for testing a custom mimetype for extractor
2015-04-28 18:00:42 -04:00
Zeeshan Lakhani
61e3170ecf Update core properties test to make sure that indexes can be reindexed after removing of data dirs and segment info files 2015-04-28 17:28:09 -04:00
Doug Rohrer
e8222900bf Merge pull request #789 from basho/bugfix/dr/receive_deadlocks_when_handoff_occurs
Fullsync replication known deadlock tests.
2015-04-28 14:30:16 -04:00
Doug Rohrer
cad958481d Test deadlock between Repl Fullsync and Handoff. Requires riak_repl and riak_kv fixes:
Tests that provoke deadlock causing full_sync to hang when handoff causes vnode to exit before the fold request is started.
2015-04-24 15:23:06 -04:00
Fred Dushin
51998fd555 Back ported branch 'master' into feature/zl/update-yz_crdt-test-for-RIAK-1672
Conflicts:
	src/rt.erl

Manually resolved
2015-04-23 08:56:29 -04:00
Zeeshan Lakhani
758cee1f10 Merge pull request #784 from basho/add-test/zl/no-core-props-unload-working
Add test removing core.properties and checking whether creation works ...
2015-04-16 18:13:05 -04:00
Zeeshan Lakhani
ac7cc9fdd9 * Add test removing core.properties and checking whether creation works without issue
* Related to https://github.com/basho/yokozuna/pull/487
* Removed a possible false-failure case.
* Use yz_rt wait_until, which is PR'ed into a new utils file for r_t in another branch

... caused by disjoint random selections for failure simulation and
recovery testing (allowing an unmolested node to report a pass).

Once we've validated that this fixes it I'll rebase the branch

foo
2015-04-16 17:20:49 -04:00
Nick Marino
2c8a2eb4f8 Merge pull request #785 from basho/nem-ensemble-sync-tickcounts
Fix occasional failures in ensemble_sync
2015-04-16 15:10:02 -04:00
Nick Marino
df438b9406 Fix occasional failures in ensemble_sync
This fixes some rare race conditions in ensemble_sync where we could
sometimes run wait_for_stable prior to an ensemble actually becoming
unstable, and then it would pass the wait but the ensemble could become
unavailable during the next step in the test.

By waiting for the ensemble leader tick counts to increment, we can
guarantee that any failures will have been "noticed" prior to our
calling wait_for_stable, because the leader_tick function ensures a
quorum is present when it executes, and steps down if it fails to get
one.
2015-04-15 18:35:01 -04:00
Andrew J. Stone
39f5573233 Merge pull request #783 from basho/bugfix/dr/fix_wait_for_connection_hang
Remove handling of timeout so pre-2.0 node connections will be detected
2015-04-14 15:22:04 -04:00
Doug Rohrer
da28931e4e Remove handling of timeout, as old-school Riak connection nodes would also result in a timeout. Try new riak_core_cluster_conn:status() first, then fall back to older 1.4.X style bare send. Lather, rinse, repeat. 2015-04-14 15:13:15 -04:00
Russell Brown
88bbcca9ce Remove extraneous and inaccurate comment 2015-04-14 16:04:23 +01:00
Russell Brown
c8af6c6056 Clean up comment typo and poorly named function 2015-04-14 16:03:20 +01:00
Russell Brown
712ea2c4ec Add failing test for github issue riak-727 2015-04-14 15:50:24 +01:00
Andrew J. Stone
4fda0c89f7 Merge pull request #781 from basho/bugfix/dr/fix_wait_for_connection_hang
Fix test hang when riak_core_cluster_conn:status failed to respond after 2 milliseconds.
2015-04-13 16:39:53 -04:00
Doug Rohrer
c603e8be14 Fix test hang when riak_core_cluster_conn:status failed to respond after 2 milliseconds.
Should resolve test failures with a message similar to:

@riak_core_cluster_conn:handle_info:402 Unmatch message {<20563.30238.10>,status}

in the server logs.
2015-04-13 16:37:43 -04:00
Alex Berghage
10d901fb09 Merge pull request #778 from basho/bugfix/zl/yz-handoff-fix-around
Reduce intermittent failure frequency in yz_handoff. Needs to be revisited as part of basho/yokozuna#485
2015-04-13 12:36:28 -05:00
Doug Rohrer
e2dcc1245c Merge pull request #780 from basho/rdb/instrument-repl
Add some extra info out to detect point of timeout in test failures
2015-04-13 13:33:06 -04:00
Zeeshan Lakhani
74b510bf47 update yz_handoff test for swiftness via single leave-state check... need to extend this w/ aggressive settings and more states eventually 2015-04-13 13:17:02 -04:00
John Daily
c89e21631f Merge pull request #755 from basho/bugfix/mw/jmx-verify-timeouts
Added a configurable wait before initial jmx_dump
2015-04-13 11:46:01 -04:00
Russell Brown
e3fece8713 Add some extra info out to detect point of timeout in test failures 2015-04-13 16:09:33 +01:00
Doug Rohrer
786aaec2d3 Merge pull request #776 from basho/bugfix/BTA-186-verify_crdt_capability
BTA-186 verify_crdt_capability
2015-04-13 09:25:19 -04:00
Andrew J. Stone
e1018624ed add debug logging to tests/replication2_connections 2015-04-10 15:49:58 -04:00
Andrew J. Stone
eb2dc7c9ed add some logging so we know which test is failing in replication2_connections. yeesh. 2015-04-09 18:42:57 -04:00
Zeeshan Lakhani
a9a229f3dd *check for map embedded fields and counts,
*increase cluster*,
*add helper to select random node from cluster*
2015-04-09 16:21:57 -04:00
Doug Rohrer
269728e04f Merge pull request #769 from basho/bugfix/BTA-182-repl_bucket_types
Added case clause for case in which we are calling get on a 2.0 or later node
2015-04-08 21:30:51 -04:00
Doug Rohrer
cf510639e8 Merge pull request #772 from basho/bugfix/BTA-183
Bugfix/BTA-183
2015-04-08 19:20:41 -04:00
Doug Rohrer
8f3a7aa7a3 Merge pull request #774 from basho/bugfix/bta-175
Fix race condition in force replace for verify_staged_clusters
2015-04-08 19:17:28 -04:00
Andrew J. Stone
b0bc89e39f Fix race condition in force replace for verify_staged_clusters 2015-04-08 18:58:59 -04:00
Doug Rohrer
fae0fe3bc8 Added wait_until_ready after upgrade, as it appears the HTTP interface
on Previous was not always available as soon as the upgrade was
complete.
2015-04-08 17:34:53 -04:00
Doug Rohrer
13598bc5b4 Merge pull request #771 from basho/bugfix/remove-obsolete-repl_reduced
Removing obsolete repl_reduced test.
2015-04-08 16:33:27 -04:00
Fred Dushin
742cbc24b9 Relaxed the capability check to make it a subset check. 2015-04-08 16:30:52 -04:00
Fred Dushin
aa4ab8405f Fixed expected capabilities on previous node 2015-04-08 15:13:48 -04:00
Fred Dushin
2fa259ea08 Fixed capability tests based on changes to previous and legacy versions against which we are testing. 2015-04-08 15:02:24 -04:00
Doug Rohrer
ad91ce48b4 Removing obsolete repl_reduced test, as the feature for which it was
written will never be shipped. Fixes BTA-167 and perhaps other intermittent failures.
2015-04-08 13:27:30 -04:00
Zeeshan Lakhani
be94f6aa71 * refactor yz_handoff test to use only rtisms
* make test a bit swifter by removing *another* leave/add and a coupel config settings
2015-04-08 09:44:49 -04:00
Fred Dushin
39718e7938 Added case clause for case in which we are calling get on a 2.0 or later node that does not have the bucket type declared. The previous behavior was targeted for pre-2.0 (pre-bucket type) behavior. 2015-04-08 09:28:15 -04:00
Doug Rohrer
19ea9a69bb Merge pull request #765 from basho/bugfix/BTA-172-verify_dt_upgrade
Fix BTA-172.
2015-04-07 09:36:03 -04:00
Sean Cribbs
3c3c7313a2 Fix BTA-172.
Previous to 2.0, the CRDT capability was exactly `[pncounter]`. However,
when upgrading from 2.0 to 2.1, the capability *includes* `pncounter`
but is not exactly equal, resulting in a hang in
`populate_counters/1`. We just want the test to be sure that it is
capable of `pncounter`, not exactly equal, which will make the test
continue on 2.0+ clusters.
2015-04-06 16:21:15 -05:00
Sean Cribbs
53d43d6aa1 Merge pull request #753 from basho/nem-ensemble-byzantine-waitfor-reads
Use rt:waitfor on reads in ensemble-byzantine test
2015-04-06 15:14:27 -05:00
Sean Cribbs
d4f3e569f3 Merge pull request #761 from basho/bugfix/BTA-174-verify_riak_stats
Fix BTA-174
2015-04-06 14:57:13 -05:00
Sean Cribbs
40a583a798 Sort stats names in product_stats/1 2015-04-06 11:25:25 -05:00
Sean Cribbs
f5060e8761 Fix BTA-174
Failure seen at
http://giddyup.basho.com/#/projects/riak_ee/scorecards/120/120-157-verify_riak_stats-centos-6-64/89645

* Typo in one stat name was corrected since
  2.0.5 (search_index_throughtput_one -> search_index_throughput_one)
* write_once code path added its own stats.
* Adjusted assertion at comparison point so that the differences appear
  in the assertion failure as well as the logs.
* Sorted the entries in the list of stats.
* Cleaned up trailing whitespace.
2015-04-06 11:16:14 -05:00
Alex Berghage
5c5931b567 Added explicit wait_for_service riak_pipe on pipe* tests...
... which didn't call the 2-arity version of deploy_nodes. NB
the /2 version implicitly waits on whatever services are specified.
2015-04-06 10:36:06 -05:00
Fred Dushin
b566fe34ed Added a wait for riak_kv to start before polling http interface. 2015-04-03 18:08:36 -04:00
Sean Cribbs
35fe241044 Merge pull request #758 from basho/feature/fd/RIAK-1534
Feature/fd/riak 1534
2015-04-01 18:30:01 -05:00
Fred Dushin
39ce8866be Fixed lingering usage of "fast_path" in overload test 2015-04-01 19:00:53 -04:00
Fred Dushin
d34ff09f8e Updated write_once handoff test to use riak_kv_w1c_put_req_v1 record, and relaxed check in test to avoid potential race condition. 2015-04-01 16:56:36 -04:00
Micah Warren
8b4b8978e6 Added a configurable wait before initial jmx_dump 2015-03-31 13:21:25 -05:00
Fred Dushin
5f547a8364 Name changes. Test is still broken though, due to use of #riak_kv_w1c_put_req_v1 record. 2015-03-27 20:57:09 -04:00
Fred Dushin
57638a86a3 Name change fast_path -> write_once 2015-03-27 20:38:47 -04:00
Fred Dushin
0a1c307bda Merge branch 'feature/fd/RIAK-1534' of github.com:basho/riak_test into feature/fd/RIAK-1534 2015-03-27 16:11:44 -04:00
Fred Dushin
444115aa93 Fixed tests to check improved error handling in put code path. 2015-03-27 16:11:38 -04:00
Andrew J. Stone
44d8d6fd65 support fast path in overload test 2015-03-27 16:00:34 -04:00
Andrew J. Stone
43ffdffc04 add verify_handoff_fastpath 2015-03-26 18:00:53 -04:00
Nick Marino
8e2b99d6e8 Use rt:waitfor on reads in ensemble-byzantine test
There were some race conditions we found in this test where reads
to an ensemble may fail for a brief period of time after recovery. By
using the waitfor function, we ensure that the tests will still pass as
long as things eventually recover and start working properly again.
2015-03-23 13:12:46 -04:00
Cuyler Jones
a1bd419973 Fixed typo; Used exact error message and module name 2015-03-19 14:09:05 -07:00
Cuyler Jones
9f2f3da1c1 Added notes about rare race condition to test 2015-03-19 11:40:45 -07:00
Zeeshan Lakhani
96675d7a49 Merge pull request #745 from basho/zl/remove-wait-for-index-need-in-yz-tests-because-sync
remove the need for wait_for_index on these tests, as we move to sync cr...
2015-03-18 13:32:02 -04:00
Fred Dushin
adb29379d6 Improved testing around timeout conditions. 2015-03-16 21:53:49 -04:00
Ted Burghart
77f2a561b3 Merge pull request #750 from basho/bugfix/reorder-replication2-ssl-tests
Restructured how connectivity tests are configured and run [RIAK-1608].
2015-03-14 07:47:36 -04:00
Brett Hazen
c5259019a3 Merge pull request #749 from basho/bugfix/bch/tweak-pb_security
Remove code:which/1 from pb_security and http_security
2015-03-13 15:39:00 -06:00
Ted Burghart
5c9169fe49 Fixed typos.
Reordered tests to use lighter configurations.
Added test for mismatched ACL.
Added timeout override.
2015-03-13 07:09:05 -04:00
Ted Burghart
ab75a72b43 Restructured how connectivity tests are configured and run [RIAK-1608].
Due to leftover configuration attributes from prior tests within this module (see RIAK-1607), the order of tests has to be managed.  In order to accomplish this and be able to see exactly what configurations are used for each test, configuration blocks have been more directly associated with each connectivity test.
2015-03-12 15:07:36 -04:00
Brett Hazen
41d0be08ac While we re removing code:which/1; also do it for http_security 2015-03-11 21:29:55 -06:00
Brett Hazen
19a8c67c3c Use code:get_object_code/1 instead of code:which/1 in pb_security 2015-03-11 18:13:40 -06:00
Nick Marino
e2c8ec77ec Add xmerl_version to the list of expected stats
This fixes some failures we were seeing with the verify_riak_stats test.
2015-03-11 16:48:22 -04:00
Fred Dushin
1dcf8ccea0 Weakened the test for cluster connectedness to ensure the set of specified nodes are a subset of all erlang:nodes() calls on all hosts (instead of being the same) 2015-03-10 12:17:21 -04:00
Fred Dushin
b0765cd175 Added a test for asynchronous puts.
This test requires creating two clusters in the test.  Because one of the clusters is partitioned during the test, the two nodes must first be partitioned, in order to not confuse any of the healing logic in riak_test.
2015-03-10 11:23:40 -04:00
Zeeshan Lakhani
04d231d078 remove the need for wait_for_index on these tests, as we move to sync creation of search indexes in yz 2015-03-09 14:17:25 -04:00
Fred Dushin
2c788a148b Added test for strict quorum puts 2015-03-09 10:07:19 -04:00
Ted Burghart
452747413f Add test that always passes.
When included in an automatically generated list of tests that may only contain one test, ensures riak_test will continue if that test fails.
2015-03-07 07:57:10 -05:00
Fred Dushin
42716994fe Added tests for primary writes (pw) and "random write wins" scenarios. 2015-03-06 21:51:26 -05:00
Fred Dushin
85bca5a57a cleanup 2015-03-05 20:12:24 -05:00
Zeeshan Lakhani
510431be8f Merge pull request #738 from basho/newtest/zl/yz_handoff
YZ Handoff Riak Test [JIRA: RIAK-1546]
2015-03-04 15:19:32 -05:00
Cuyler Jones
c05aed83f2 Merge pull request #740 from basho/refactor/replication2
Refactor/replication2
2015-03-04 10:37:48 -08:00
Zeeshan Lakhani
d36ec68fa7 handoff test for lost documents using riak_admin to make sure issue uses same env as previously seen issue 2015-03-04 05:42:47 -05:00
Fred Dushin
a36c7b43d3 stub test for immutable puts 2015-03-02 15:37:55 -05:00
Cuyler Jones
ab61070927 Fixed parenthetical spacing and a few other text spacing issues 2015-02-28 21:34:38 -07:00
Cuyler Jones
82d9d57a24 Fixed spacing to adhere to Basho guidelines. 2015-02-28 21:24:59 -07:00
Cuyler Jones
c6d27d5a99 Replaced start up process with build_clusters; Removed some lingering refactoring cruft. 2015-02-25 10:23:33 -08:00
Doug Rohrer
9f116ab088 Merge pull request #737 from basho/feature/support_wildcard_certs_in_tests
Better support for wildcard certificates
- riak_core#701 is now merged, so merging tests.
2015-02-25 11:54:27 -05:00
Cuyler Jones
ef0581cefb Code clean up and small node logging additions 2015-02-24 14:48:58 -08:00
Cuyler Jones
57fde811ef Refactored Replication 2 into compartmentalized tests with documentation. Removed superfluous code. 2015-02-24 14:41:38 -08:00
Doug Rohrer
137a11c690 Address review comments & add more tests
- Change Path -> DirName in make_certs
- Added both more basic and more advanced tests with wildcard certs
2015-02-12 08:40:18 -05:00
Andrew J. Stone
42e7788f9d Merge pull request #734 from basho/bugfix/demystify-overload
Refactor tests/overload to use less magical-number comparisons
2015-02-11 17:57:44 -05:00
Andrew J. Stone
44d18d1a87 Remove magic number for overload test 2015-02-11 17:54:12 -05:00
Doug Rohrer
965522c705 Better support for wildcard certificates:
- Update make_certs to support wildcard certificates by passing an additional Path parameter.
- add replication2_ssl test to ensure using wildcard certificates works after PR to riak_core_ssl_util.
2015-02-11 16:27:39 -05:00
Doug Rohrer
04d54cb810 Change verify_dt_converge to use riakc_flag:enable(F) rather than disable. This will work
even in the rare and pathological case where the cluster is partitioned before all 3 nodes
have received the update. riakc_flag:disable(F) requires context, which isn’t there in the 
new map that would be created on the side of the partition with no data.
2015-02-11 10:47:35 -05:00
Alex Berghage
a9bef3c0a7 Rolled back change to test_vnode_protection threshold.
This test's a little confused in the sources as-is since it prints
like it's based on the number of requests, even though the actual
comparison is done against a function of THRESHOLD. I've reverted
to the comparison used currently, since it looks to me like this
test should really expect to have ~NUM_REQUESTS processes, and a
vnode queue pretty close to THRESHOLD. I'd appreciate review here
though, particularly if anyone recalls the original intent of
these comparison numbers.
2015-02-10 21:15:23 -05:00
Alex Berghage
ec562e9461 Fixed predicate print formatter (was ~w, now ~p) 2015-02-10 21:11:32 -05:00
Alex Berghage
a7258f4ed7 Loosened criteria for successful overload test.
Previously we'd used a sort of fuzzy 'metric' where we expect the
number of successful requests/fsms to be less than some fudge
factor over the overload threshold. This tends to kick up spurious
failures on the test board without offering much more in the way
of assurances about overload's functionality.

This change instead bases test success on the number of requests
only, not the threshold — if some amount of work was shed at all
we consider that a passing test.

In the future we should revisit this and change the request
accounting machinery to just explicitly track denials instead of
fsm processes / vnode queue depth.
2015-02-10 20:49:16 -05:00
Alex Berghage
e0921b1bf1 Reduced repetition in tests/overload predicates 2015-02-10 20:44:09 -05:00
Christopher Meiklejohn
2179361bb3 Resolve incorrect guard. 2015-02-09 16:56:30 -05:00
Ulf Wiger
36f80415e1 add vnode_get_fsm_time_* stats 2015-02-09 22:49:04 +01:00
Russell Brown
9efa9602d0 Merge pull request #728 from basho/kv667
Test for basho/riak#667, read pre 2.0.4 sets/maps
2015-01-29 14:27:50 +00:00
Russell Brown
bc8a3b240b Change env var name 2015-01-27 18:00:48 +00:00
Russell Brown
d22fbfbdc7 Persuade rtdev harness to set advanced.config variables
In the case that no advanced.config file exists (everycase!) rt
would not add any advanced config settings to the conf.

This PR teaches rtdev to create an advanced.comfig file if none exists
so that tests may set advanced config.

In this case we set ring_size and also the `crdt_mixed_versions` app env
2015-01-27 09:53:15 +00:00
Sean Cribbs
fd086aa5bc Finished up format test.
This is currently failing, somehow fetching the map immediately after
updates on the upgrade, I'm getting the dict format.
2015-01-26 15:34:11 -06:00
Sean Cribbs
dea8a89160 Fetch maps after upgrade, checking that they match. 2015-01-24 11:49:18 -06:00
Sean Cribbs
af9fe7a63e WIP extending mixed-cluster test.
Still have not completed upgrade and feature-flag switch.

I changed the versions from atoms to "2.0.2" and "2.0.4", we can
bikeshed that with build/test czars on Monday.

Added some useful logging statements to the plain-upgrade test.

Removed unnecessary clean_cluster and systest_read calls.
2015-01-23 18:02:13 -06:00
Sean Cribbs
86202d010e Little tweak to use literal versions and binaries for keys. 2015-01-23 16:04:48 -06:00
Sean Cribbs
ca040bdaee Begin split of test into two parts, safe upgrade and mixed cluster 2015-01-23 14:09:48 -06:00
Russell Brown
7b4ffcbc18 Extend kv667 test for upgrade between 2.0.2/4/5
This test is getting Biig, and there is still a lot to add
(see comments at the end of the test.)

Maybe we should break it out into a few tests, and there are some
questions still.
2015-01-23 18:02:32 +00:00
Russell Brown
49183d92b9 Update Chris's test to test nested sets and maps too 2015-01-21 16:27:44 +00:00
Christopher Meiklejohn
569a76d490 Add riak_kv#667 reproducer. 2015-01-20 18:09:20 +01:00
Russell Brown
c1b12f5069 Merge pull request #719 from basho/rdb/gh-kv679
Riak Tests for scenarios of basho/riak_kv#679
2015-01-16 13:35:00 +00:00
Russell Brown
ccb5af778b remove test for unfixed case 2015-01-15 17:19:41 +00:00
Russell Brown
3657c57bac Address review comments
Remove those timers and other remnants of flailing when first
trying to write tests.
2015-01-15 16:09:02 +00:00
John Burwell
a37832d7f9 Verify the list of HTTP stats keys is complete
* Checks the list of stats keys returned from the HTTP endpoint
    is complete -- delineating between riak and riak_ee.  The test will
    fail if the list returned from the HTTP endpoint does not exactly match
    the expected list.  This behavior acts as a forcing function to ensure
    that the expected list is properly maintained as stats are added and
    removed.
    * Modifies reset-current-env to properly clean dependencies when a
    full clean is requested and remove the current directory in the
    target test instance.
    * Adds logging to verify_riak_stats to explain the addition steps
    being performed
    * Adds rt:product/1 to determine whether a node is running riak,
    riak_ee, or riak_cs
    * Adds tools.mk support and eunit scaffolding to rebar.config
    * Modifies reset-current-env.sh to remove the current directory in
    the target test instance
2015-01-08 17:23:25 -05:00
John Burwell
f98b3aed87 Merge pull request #713 from uwiger/uw-cmp-http-and-console-stats
Verify that the stats in riak-admin status match those from the HTTP endpoint
2015-01-06 15:02:01 -05:00
Andrew J. Stone
a28f70eb1a Merge pull request #720 from basho/bugfix/ensemble_sync_partition_error
Fix partition issue in ensemble_sync
2015-01-05 16:30:08 -05:00
Andrew J. Stone
70da1fd2f9 separate some folds out of partition/2 in ensemble_sync for clarity 2015-01-05 15:42:26 -05:00
Russell Brown
d86d122308 WIP 2015-01-02 09:02:13 +00:00
Russell Brown
a1af1140f0 WIP data loss test was failing for the wrong reason
add some debugging log out
2015-01-02 09:02:13 +00:00
Russell Brown
8cbec21a5d WIP Kv679 tests
Dataloss at coordinator
Dataloss at coordinator with an old clock repaired
Vnode id's not unique
2015-01-02 09:02:13 +00:00
Russell Brown
87afd498e6 WIP 2015-01-02 09:02:13 +00:00
Russell Brown
6097bda909 WIP kv679 tombstone test A-B-C, crash before reap 2015-01-02 09:02:13 +00:00
Russell Brown
a29d3c3494 WIP kv679 doomstone tests 2015-01-02 09:02:13 +00:00
Russell Brown
4f7e9e17af WIP make the test fail 2015-01-02 09:02:13 +00:00
Russell Brown
40882ea81d WIP remove voodoo code, add inactivity timeout 2015-01-02 09:02:13 +00:00
Russell Brown
aece28adc9 WIP kv679 tombstone-doomstone-boomstone test 2015-01-02 09:02:12 +00:00
Sean Cribbs
d46928bea2 Merge pull request #723 from basho/sdc/bugfix/http-bucket-types-harden
Wrap all list_keys and list_buckets calls with wait_until
2014-12-31 10:35:30 -06:00
John R. Daily
9fdf2ac1f4 Expected output changed as of PR#678 2014-12-31 10:56:47 -05:00
Sean Cribbs
a67140a1b9 Wrap all list_keys and list_buckets calls with wait_until
Because list-keys and list-buckets use coverage, we might hit latent
replicas depending on the coverage plan. This gives each call some extra
tries to complete successfully.
2014-12-31 09:27:23 -06:00
Andrew J. Stone
043baabf3f Fix partition issue in ensemble_sync
It was previously possible for the 'minority' network partition to
become the majority network partition by a naive network partitioning
strategy. Previously, when a preference list of 5 keyspace partitions
was created on only four distinct nodes, it became possible for a 2 node
'minority' network partition group to actually have a majority of
keyspace partitions because 2 keyspace partitions were assigned to 1
node in the 'minority' group. This was fixed so that the 'majority'
group now always has a majority of keyspace partitions by preventing
nodes with greater than 1 keyspace partition from becoming part of the
'minority' group.
2014-12-30 15:21:18 -05:00
Sean Cribbs
41b60d0328 Make sure the httpd started exits with the test.
Now green when run in sequence:

Test Results:
pb_cipher_suites-bitcask: pass
pb_security-bitcask     : pass
---------------------------------------------
0 Tests Failed
2 Tests Passed
That's 100.0% for those keeping score
2014-12-29 14:08:36 -06:00
Andrew J. Stone
1addf5207f Merge pull request #716 from basho/bugfix/ensemble_basic3
Use result of fold in ensemble_basic3
2014-12-24 12:57:52 -05:00
Engel A. Sanchez
0328b4e7d7 Fix failure on slow replication
This changes the test assertion so that it retries fetching the value
from the second cluster until it is the expected value, at which point
the test will either pass if the sibling count is reasonable or fail if
it is too damn high.
2014-12-23 16:41:59 -05:00
Andrew J. Stone
32d90ca13a Don't try to resume pids that don't exist 2014-12-23 15:59:37 -05:00
Andrew J. Stone
7d097080d3 Use result of fold in ensemble_basic3
When resuming vnodes we need the proper pid from the previous suspend.
Use the result of the fold to get the right pids.
2014-12-23 15:50:03 -05:00
Sean Cribbs
abfa0ebf8a Adjust test to ensure that essential concurrency =< n_val 2014-12-23 14:15:55 -06:00
Sean Cribbs
ca44485d1f Fix race condition in verify_dvv_repl.
Fetch the sink object on each iteration of the wait_until, just in case
that the entire set of siblings didn't make it across the repl link.
This also gives read-repair a chance to happen, in case the version the
sink wrote didn't make it to all replicas.
2014-12-23 10:51:50 -06:00
Engel A. Sanchez
556cb7210c Ensure riak_repl service is up across the board
Trying to use the repl features before newly started nodes have
riak_repl completely initialized leads to all sorts of nasty crashes and
noise. Frequently it makes fullsync stuck forever, which makes a lot of
the tests fail.

This also tweaks the AAE fullsync tests to remove assumptions about
failure stats when AAE transient errors occur. The behavior in the
handling of those errors has changed recently with the introduction of
soft exits.
2014-12-18 16:07:00 -05:00
Ulf Wiger
07ec15dc44 check riak-admin status 2014-12-16 20:42:28 +01:00
John Burwell
5a6150ab14 Merge pull request #702 from basho/jsb/reduce-snmp-poll-interval
Reduce the SNMP stats poll interval form 1 minute to 1 second
2014-11-12 13:28:13 -05:00
John Burwell
b103c52a87 Reduces the SNMP stats poll interval from 1 minutes to 1 second
Reduces the probability of a race condition between the calculation of spiral/histogram metrics and SNMP stat cache refresh by reducing the SNMP poll interval to 1 second during test execution
2014-11-12 12:50:57 -05:00
John Burwell
a38166e6ea Configure all JMX listeners to use an unprivledged port the test can be
executed when not running as root.
Logs the JMX query command line to assist test failure debugging
2014-11-11 22:06:22 -05:00
Christopher Meiklejohn
98b66a3190 Resolve race condition.
Don't wait for convergence of the ring, because bucket properties are no
longer stored in the ring; instead, wait until the property changes,
which means the gossip has stabilized.
2014-11-06 12:30:03 -08:00
Micah
696e034ed8 Merge pull request #684 from basho/feature/mw/cluster-mgr-refactor-support
Fixed cluster connection detmination function.
2014-10-30 11:40:43 -05:00
Russell Brown
1f4e504b70 Test for kv679 unique vnode id on a node 2014-10-15 11:29:16 +01:00
Kelly McLaughlin
cf55f195a5 Add option to return exit code from rt:admin calls
Add an rt:admin/3 function that accepts a list of options as the third
parameter.  Currently the only valid option is return_exit_code. The
rtdev, rtssh, and rt_cs_dev harnesses have been updated to support
this option. If return_to_exit is specified the return from a
?HARNESS:admin call is a pair with the exit code as the first member
and the command output as the second member. Finally the
basic_command_line test has been changed to use return_for_exit to
verify the changes.
2014-10-10 15:37:54 -06:00
Micah Warren
d1891f69fd Fixed cluster connection detmination function.
Due to the refactor for the cluster manager/connection manager system to
use otp behaviors, the raw message method of getting stats has been ousted.
Instead, it uses a call. To allow the riak_test to be able to check older
clusters as well as the method, the function was extended to try new and
then the old.
2014-09-24 15:42:26 -05:00
Kelly McLaughlin
7cd2645564 Add verification of handoff heartbeat to verify_handoff test
Add testing of the handoff heartbeat change from the following pull
request: https://github.com/basho/riak_core/pull/560. Add an intercept
module for the riak_core_handoff_sender module to introduce artificial
delay on item visitation during a handoff fold. This delay along with
the changes to the verify_handoff test induces test failure when run
without the heartbeat change. The handoff_receive_timeout is exceeded,
handoff stalls, and the test eventually fails due to timeout. The test
succeeds when run with the heartbeat change.
2014-09-11 15:05:26 -06:00
Russell Brown
45846699c7 Wait until all changes are replicated before passing
Realtime repl takes real time, so wait for it to finish before
calling the test done.
2014-08-27 18:21:00 +01:00
Russell Brown
57cbd61b35 Ensure realtime repl is doing it's thing before making updates 2014-08-27 15:33:10 +01:00
Sean Cribbs
67dfe49934 Merge pull request #654 from basho/sdc-nitpick-counter-cap
Fix nitpick about new/old API from #653.
2014-08-26 09:16:33 -05:00
Andrew J. Stone
a6cb50cdfc use 8 instead of 9 nodes to satisfy giddyup 2014-08-25 18:56:24 -04:00
Kelly McLaughlin
8aa5c3f33f Merge branch 'feature/sc-overload-testing' 2014-08-25 13:58:27 -06:00
Kelly McLaughlin
1253c8ad3e More robustification against races between overloading the vnodes and the list_keys attempt for the coverage testing.
Conflicts:
	tests/overload.erl
2014-08-22 15:58:47 -06:00
Andrew J. Stone
c42d2ac055 Merge pull request #666 from basho/ajs/ensemble_byzantine
Add tests for SC byzantine dataloss and tree loss
2014-08-21 23:17:01 -04:00
Jon Anderson
747212678b Merge pull request #676 from basho/bugfix/jra/bug-611
Add repl_bucket_types check that RTQ is drained after bucket type mis-match
2014-08-21 20:52:29 -04:00
Jon Anderson
613bdc29f1 Change rtq drainage check to use dumpq and put it in a function. 2014-08-20 08:07:26 -04:00
Andrew J. Stone
44f562c2d5 add force_replace test to ensemble_ring_changes 2014-08-19 19:22:58 -04:00
Jon Anderson
f36abd590d Add a check to make sure the RTQ queues drain after an unknown bucket
type repl.
2014-08-19 16:36:45 -04:00
Sean Cribbs
b8d8e3026b Verify that datatype stats are updated.
See basho/riak_kv#1017
2014-08-18 15:24:20 -07:00
Kelly McLaughlin
3577f476e6 Updates to bucket property validation test
* Rename the module from validate_nval_etc to bucket_props_validation
* Employ testing of protocol buffers connections in addition to HTTP
2014-08-13 16:32:34 -06:00
Russell Brown
4e7936da17 Riak 2.0 allow_mult defaults to true 2014-08-13 14:15:25 -06:00
Russell Brown
f100468e11 Test for bad bucket property validation 2014-08-13 14:15:25 -06:00
Andrew J. Stone
07de5cb9c9 Update ensemble_ring_changes w/ node replace test
Additionally fix cluster expansion and make test deterministically pass.
2014-08-13 00:22:02 -04:00
Andrew J. Stone
cee6cbf4ef Fixup ensemble_ring_changes
* Add some logging
* Ensure updates work before and after ring expansion
2014-08-08 17:38:05 -04:00
Andrew J. Stone
d057999a7b WIP - Add ensemble_ring_changes
Ensemble_ring_changes tests writing a value, expanding the cluster, then
updating and reading that value after ring expansion has completed. It
also creates a bucket using a bucket type with a different n_val from
the default bucket type.  The latter tests basho/riak_kv#1008 and it's
corresponding riak_core PR.
2014-08-05 17:23:42 -04:00
Andrew J. Stone
3bed92cfb4 Use rt functions to safely remove backend data
Use riak_test_runner:metadata/0 to get the configured backend instead of
defaulting to bitcask. Additionally we use rt:clean_data_dir/2 to safely
remove backend directories.
2014-07-29 17:05:50 -04:00
Christopher Meiklejohn
ccc4d403d4 Add yz test for search over maps. 2014-07-29 12:33:16 -04:00
Andrew J. Stone
5319e75269 Add tests for SC byzantine dataloss and tree loss
This is the first iteration of creating byzantine dataloss tests that
show both recoverable and unrecoverable, but detectable errors. This tests the
following scenarios.

 * Lose one partition worth of data, but no synctrees and recover.
 * Lose all but one partition of ensemble data, but no synctrees and
   recover.
 * Lose minority of synctrees. Only the peers with the missing
   synctrees are restarted. System remains available.
 * Loss of majority of synctrees. Majority peers are restarted. System
   recovers when they all come back online.
 * Loss of majority of synctrees with one node partitioned. All peers
   restarted except partitioned one. System does not recover with that
   node partitioned. When the partition is healed the system recovers.
 * Loss of all data and synctree except on one peer recovers.
 * Backing up and restoring old data but not synctrees results in
   detected errors. Restoring newer data fixes this.
 * Delete all data on all nodes, but not synctrees. This is detected and
   an error returned to the user.
2014-07-25 13:58:02 -04:00
Jon Anderson
09a60d1289 add timeouts to calls instead of from .riak_test.config 2014-07-24 14:47:46 -04:00
Jon Anderson
9746731f09 initial commit 2014-07-22 17:08:12 -04:00
Jon Anderson
a597b3ee63 removed sleep 2014-07-22 17:03:16 -04:00
Jon Anderson
0671b59b4c add a intercept/server accounting of get_fsm processes. 2014-07-22 16:48:25 -04:00
Kelly McLaughlin
98681cd658 Increase the test code coverage and make the coverage checking more
robust to failures.
2014-07-15 16:48:43 -06:00
Kelly McLaughlin
ceb24fc3e2 Merge pull request #661 from basho/bugfix/replication-ssl-site-ip-verification
Fix failure of replication_ssl test introduced by 297090d
2014-07-15 15:08:50 -06:00
Kelly McLaughlin
0ab2393559 Change replication SSL ACL tests to avoid certificate expiration
Change the ACL test case in the replication_ssl and replication2_ssl
tests to use certificates generated within the tests instead of
relying on certificates created outside the test that are prone to
expire and cause spurious test failure.

Also change the replication_ssl and replication2_ssl tests to avoid a
cycle of standing up the test clusters and then immediately restarting
them before any tests cases execute. This should make the test
execution slightly faster for both test modules.

This commit also changes the tests to be a bit more robust in checking
for cluster state when restarting nodes and removes an unnecessary
five second sleep call in the replication_ssl test.
2014-07-15 12:06:06 -06:00
Kelly McLaughlin
9c5daf0f31 Fix failure of replication_ssl test introduced by 297090d
Change replication_ssl to use the wait_for_site_ips function from the
replication module introduced in
297090ded6 instead of the defunct
verify_site_ips function.
2014-07-14 12:37:32 -06:00
Russell Brown
a212b99a75 Update expected return to match change in riak_pb API
See 2b68a97710
for details.
2014-07-14 17:14:51 +01:00
Joseph Blomstedt
695853cc94 Merge pull request #657 from basho/bugfix/ensemble-interleave-error-failed
Fix ensemble_interleave error condition
2014-07-11 21:42:59 -07:00
Andrew J. Stone
f0643db473 Fix ensemble_sync by allowing {error, <<"failed">>
Allow {error, <<"failed">>} as an error response in ensemble_sync. Fixes
the test with basho/riak_ensemble#37 and basho/riak_kv#1002
2014-07-11 18:11:30 -04:00
Andrew J. Stone
38bd8399d1 Fix ensemble_interleave error condition
Include {error, <<"failed">>} as allowed failure so that test passes
with changes for basho/riak_ensemble#37 and basho/riak_kv#1002
2014-07-11 17:41:48 -04:00
Kelly McLaughlin
b59fb48611 Expand the overload test to include strong consistency
Change the overload test to exercise the strongly consistent code
paths in addition to the eventually consistent paths during overload
conditions.
2014-07-11 13:23:48 -06:00
Kelly McLaughlin
297090ded6 Avoid a race condition in the replication test module
Avoid a race condition in the replication test module when checking
for site IP addresses in the replication status output.  The test
waits for a connection on the leader, but it only queries the
replication status to check for the expected site IP addresses a
single time. Change the test to wait and re-check the status output to
give greater assurance that if the expected site IP addresses are not
present it is due to legitimate failure and not a race condition in
checking the replication status. This change affects the replication
and replication_upgrade tests as well as any other tests that call the
replication:replication function.
2014-07-01 16:36:45 -06:00
Jon Anderson
054c015d10 Merge pull request #651 from basho/feature/jra/verify_listkeys_eqcfsm
Expand verify_listkeys_eqcfsm to track varying buckets and n_vals.
2014-07-01 13:19:41 -04:00
Sean Cribbs
17b94da468 Fix nitpick about new/old API from #653. 2014-07-01 08:53:00 -05:00
Christopher Meiklejohn
f085f70169 Merge pull request #653 from basho/features/csm/crdt-capability
Prevent autoreconnect problem.
2014-06-30 18:29:05 -04:00
Christopher Meiklejohn
da34719fe3 Prevent autoreconnect problem.
Prevent a situtation where the auto-reconnect hasn't triggered yet
causing the result to be an error, instead of ok, on the next operation
after reconnecting.  Force a disconnect and reconnect to make sure the
test is deterministic.
2014-06-30 17:22:20 -04:00
Jon Anderson
7c2d7cc827 Expand verify_listkeys_eqcfsm to track varying buckets and n_vals. 2014-06-26 16:13:58 -04:00
Eric Redmond
0eb2d1c443 Merge pull request #650 from basho/er/yz-ensemble-test
Test that ensemble delete functions in yokozuna
2014-06-23 17:38:09 -07:00
Engel A. Sanchez
3662965705 Merge pull request #649 from basho/feature/ensembles-wait-for-riak-kv
Feature/ensembles wait for riak kv
2014-06-23 14:24:12 -04:00
Eric Redmond
266f9858eb Test the ensemble delete function 2014-06-20 14:44:30 -07:00
Engel A. Sanchez
d32d007f4d Fix service/peer check race
Changing to fetching the list of peers first, then check if the riak_kv
service is up. If the service is up, then check the peers. Otherwise it
is possible to see the service down, then peers up because it went up in
the interim.

Also, making KV vnode delay configurable.
2014-06-20 14:26:24 -04:00
Engel A. Sanchez
3bf0954253 Test ensemble peers wait for riak_kv service
Now ensemble peers are prevented from starting up until the riak_kv
service is up to avoid nasty races that could even lead to node crashes
as the ensembles frantically query for data that isn't ready.
2014-06-19 23:26:30 -04:00
Kelly McLaughlin
4b9a77c828 Re-initiate fullsync after a number of failed checks for completion
Re-initiate fullsync after 100 failed checks for completion. The
number of retries of the 'start fullsync and then check for
completion' cycle is configurable using
repl_util:start_and_wait_until_fullsync_complete/4 and defaults to 20
retries. This change is to avoid spurious test failures due to a rare
condition where the rpc call to start fullsync fails to actually
initiate the fullsync. A very similar changed for the version of the
start_and_wait_until_fullsync_complete in the replication module
introduced in 0a36f9974c has had good
effect at avoiding this condition for v2 replication tests.
2014-06-19 14:34:56 -06:00
Kelly McLaughlin
3466aa7c24 Merge branch 'bugfix/fix-repl-object-reformat' 2014-06-18 22:14:35 -06:00
Jon Anderson
c424848bb0 Merge pull request #644 from basho/bugfix/jra/listkeys_eqc_setup
Fix occasional setup errors in verify_listkeys_eqcfsm
2014-06-18 20:29:00 -05:00
Jon Anderson
f3f5e40a36 Removed commented function. 2014-06-18 18:15:30 -05:00
Kelly McLaughlin
f8e10f2f75 Reinstate concurrency in replication_object_reformat test 2014-06-18 15:55:27 -06:00
Kelly McLaughlin
87ee6f5883 Fix replication_object_reformat test failure
Part of the condition checking done in the replication_object_reformat
test is to validate the results of a fullsync using
repl_util:validate_completed_fullsync/6. The way in which the the
function is called from the test expects fullsync to complete with 0
error_exit or retry_exit conditions occurring. This requires that sink
cluster be in a steady state with all partitions available. The test
failed to wait for such conditions to occur and instead relied on
performing a node downgrade asynchronously and waiting for up to 60
seconds for a completion message before continuing with the test. The
test was continually failing after a node was downgraded to `previous`
due to partitions being reported as `down` on that node. To resolve
the issue the node downgrade process is now done in the primary test
process instead of in a separate spawned process. After the version
downgrade is complete, the test now waits for the riak_repl and the
riak_kv services, calls rt:wait_until_nodes_ready/1, calls
rt:wait_until_no_pending_changes/1, and finally waits for the
riak_repl2_fs_node_reserver named process to be registered on the
downgraded node. This process is responsible for handling partition
reservation requests and is key to determining the the new node is
able to handle a fullsync without partition errors.
2014-06-18 15:55:27 -06:00
Kelly McLaughlin
c55e473b97 Merge branch 'feature/update-repl-systest-read-use' 2014-06-18 15:52:48 -06:00
Kelly McLaughlin
2f9a3cae4a Update calls to rt:systest_read to handle identical siblings
Update the calls to rt:systest_read in repl_util and
repl_aae_fullsync_util to treat identical siblings resulting from the
use of DVV as a single value.  These changes are specifically to
address failures seen in the repl_aae_fullsync_custom_n and
replication_object_reformat tests, but should be generally useful for
replication tests using the utility modules that and that have
allow_mult set to true.
2014-06-18 14:33:44 -06:00
Andrew J. Stone
7d0301db35 add intercept for riak_kv_ensemble_backend:handle_down/4 in ensemble_vnode_crash 2014-06-17 23:13:44 -04:00
Andrew J. Stone
6c14c7c371 Add test to kill a vnode and vnode proxy
Kill a vnode and it's proxy for a given key and ensure that operation
reads succeed afterwards.
2014-06-17 17:57:15 -04:00
Jon Anderson
baf32904af Remove un-used clean up function. 2014-06-17 17:26:23 -04:00
Jon Anderson
8912210036 Re-enable AAE. 2014-06-17 17:04:39 -04:00
Jon Anderson
472241f180 Take cluster set up out of a state and instead put it in the property. 2014-06-17 16:49:20 -04:00
John Burwell
6733c099c8 Merge pull request #636 from basho/bugfix/jsb/start-ensemble-without-aae
Verify Riak Startup when Strong Consistency is Misconfigured
2014-06-16 09:30:33 -04:00
Micah
c96f318f6a Merge pull request #643 from basho/bugfix/mw/better-isolate-pb_security-certs
isolate certs created for the pb_security tests.
2014-06-12 17:30:36 -05:00
Micah Warren
f7631b42c3 pb_cipher_suites test creates certs in its own dir.
Same reason as pb_security and http_security: to keep other tests
from stomping on it.
2014-06-12 17:22:42 -05:00
Micah Warren
f96847beb8 isolate certs created for the pb_security tests.
This should prevent other tests from interfering in its execution
2014-06-12 17:18:15 -05:00
Kelly McLaughlin
0589935931 Fix problems with cert specifications causing replication_ssl to fail
Fix problem with cacertdir specification in replication_ssl test. The
code used load cert files in v2 replication expects the path specific
by the cacertdir key to only be a directory. With v3 replication the
code used is flexible enough to allow a directory or a file. Also
correct a typo in the certfile path for the SSLConfig1 configuration.
2014-06-12 12:38:58 -06:00
Kelly McLaughlin
5f5c3ac035 Merge branch 'bugfix/replication-upgrade-fixes' 2014-06-12 10:39:53 -06:00
Kelly McLaughlin
21b64526f1 Fix two issues with replication_upgrade test
* Do not attempt to cancel fullsync if the initial attempt to start
  and wait for completion fails. It has not been observed that the
  problem is fullsync starting and not completing in time, but rather
  the issue is that the initial call to start fullsync does not take
  effect. Therefore the cancellation is unnecessary.
* Replace the call to repl_util:wait_for_connection/2 in the node
  upgrade process with a call to
  replication:wait_until_connection/1. This function is geared towards
  v2 replication and should speed up test execution.
2014-06-11 21:53:29 -06:00
Micah
2c5def132c Merge pull request #638 from basho/bugfix/mw/pb_security-using-removed-function
Fixed map crdt creation
2014-06-11 13:50:09 -05:00
Micah Warren
3067209a97 Fixed map crdt creation
riakc_map:add/2 no longer exists, so updated the creation of that key to
use the correct update semantics.
2014-06-11 13:25:13 -05:00
Kelly McLaughlin
0e2b52d8b1 Fix timing issue with jmx_verify test
Replace use of a 40 second sleep in the test_supervision test case
with a wait condition to better handle variances in the time it takes
to progress through 10 retry attempts.
2014-06-11 11:26:45 -06:00
Kelly McLaughlin
0601cd594f Merge branch 'bugfix/replication-upgrade-return-term' 2014-06-10 17:14:31 -06:00
John Burwell
6d8c504dba - Verifies Riak startup behavior when strong consistency is enabled and
AAE is disabled.  (defect https://github.com/basho/riak_kv/issues/959)
- Adds additional console output to reset-current-env to explain
  configuration and steps being executed
- Adds the -n option to the reset-current-env script to specify the
  number of nodes to build.  By default, 5 will be created.
2014-06-10 15:01:10 -04:00
Kelly McLaughlin
ba4db5ac74 Change replication upgrade tests to return pass on success
As of commit 3044839456 tests that
return something other than the prescribed success atom 'pass' to
indicate success result in test failure. Change the
replication_upgrade and replication2_upgrade tests that return the
result of the a call to lists:foreach/2 to instead return 'pass' to
indicate success.
2014-06-10 12:58:57 -06:00
Joseph Blomstedt
4322795d8f Merge pull request #632 from basho/jdb-auto-ensemble
Update ensemble tests to work with auto-activation
2014-06-06 16:45:43 -07:00
Engel A. Sanchez
cf10cf96a4 Merge pull request #594 from basho/feature/bitcask-tombstone2-upgrade
Verify Bitcask tombstone 2 upgrade
2014-06-06 10:34:15 -04:00
Joseph Blomstedt
65e15a50ad Update ensemble tests to work with auto-activation
Prior to this commit, the various riak_ensemble related tests would
manually enable the consensus system on one-and-only-one node in a
given cluster in order to work around issue basho/riak_core#571.

This commit changes the tests to work properly after the above issue
has been fixed.

In addition to removing the call to riak_ensemble_manager:enable()
that is now handled automatically by Riak, this commit also removes
a few wait_until_stable/2 checks against 1-node clusters. These
checks no longer apply, since Riak is now designed to only enable
the consensus system after the cluster contains at least 3 nodes.
2014-06-05 16:43:12 -07:00
Christopher Meiklejohn
e07c5eb3f6 Merge pull request #626 from basho/bugfix/bjs/riak_repl_cancelled_connections_cleanup
Disconnect and cancelled connection tests
2014-06-05 16:09:34 -04:00